dc.contributor.author |
Dellert, Johannes |
|
dc.contributor.author |
Buch, Armin |
|
dc.date.accessioned |
2016-03-02T09:49:03Z |
|
dc.date.available |
2016-03-02T09:49:03Z |
|
dc.date.issued |
2016-03-02 |
|
dc.identifier.other |
457035439 |
de_DE |
dc.identifier.uri |
http://hdl.handle.net/10900/68640 |
|
dc.identifier.uri |
http://nbn-resolving.de/urn:nbn:de:bsz:21-dspace-686406 |
de_DE |
dc.identifier.uri |
http://dx.doi.org/10.15496/publikation-10058 |
|
dc.description.abstract |
We propose a new method for empirically determining lists of basic concepts for the purpose of compiling extensive lexicostatistical databases. The idea is to approximate a notion of “swadeshness” formally and reproducibly without
expert knowledge or bias, and being able to rank any number of concepts given enough data. Unlike previous approaches, our procedure indirectly measures both stability of concepts against lexical replacement, and their proneness to phenomena such as onomatopoesia and extensive borrowing. The method provides a fully automated way to generate customized Swadesh lists of any
desired length, possibly adapted to a given geographical region. We apply the method to a large lexical database of Northern Eurasia, deriving a swadeshness ranking for more than 5,000 concepts expressed by German lemmas. We evaluate this ranking against existing shorter lists of basic concepts to validate the method, and give an English version of the 300 top concepts according to this ranking. |
en |
dc.language.iso |
en |
de_DE |
dc.publisher |
Universität Tübingen |
de_DE |
dc.rights |
ubt-podok |
de_DE |
dc.rights.uri |
http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=de |
de_DE |
dc.rights.uri |
http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=en |
en |
dc.subject.classification |
Sprachstatistik , Phylogenetik |
de_DE |
dc.subject.ddc |
400 |
de_DE |
dc.subject.other |
Lexicostatistics |
en |
dc.subject.other |
Swadesh lists |
en |
dc.subject.other |
phylogenetic linguistics |
en |
dc.title |
Using computational criteria to extract large Swadesh lists for lexicostatistics |
en |
dc.type |
ConferencePaper |
de_DE |
utue.publikation.fachbereich |
Allgemeine u. vergleichende Sprachwissenschaft |
de_DE |
utue.publikation.fakultaet |
5 Philosophische Fakultät |
de_DE |
utue.publikation.fakultaet |
5 Philosophische Fakultät |
de_DE |
utue.opus.portal |
CPAL |
de_DE |