Using computational criteria to extract large Swadesh lists for lexicostatistics

Dellert, Johannes; Buch, Armin

Publikationsdienste
→
TOBIAS-lib - Publikationen und Dissertationen
→
5 Philosophische Fakultät
→
Dokumentanzeige

dc.contributor.author	Dellert, Johannes
dc.contributor.author	Buch, Armin
dc.date.accessioned	2016-03-02T09:49:03Z
dc.date.available	2016-03-02T09:49:03Z
dc.date.issued	2016-03-02
dc.identifier.other	457035439	de_DE
dc.identifier.uri	http://hdl.handle.net/10900/68640
dc.identifier.uri	http://nbn-resolving.de/urn:nbn:de:bsz:21-dspace-686406	de_DE
dc.identifier.uri	http://dx.doi.org/10.15496/publikation-10058
dc.identifier.uri	http://nbn-resolving.org/urn:nbn:de:bsz:21-dspace-686408	de_DE
dc.identifier.uri	http://nbn-resolving.org/urn:nbn:de:bsz:21-dspace-686404	de_DE
dc.description.abstract	We propose a new method for empirically determining lists of basic concepts for the purpose of compiling extensive lexicostatistical databases. The idea is to approximate a notion of “swadeshness” formally and reproducibly without expert knowledge or bias, and being able to rank any number of concepts given enough data. Unlike previous approaches, our procedure indirectly measures both stability of concepts against lexical replacement, and their proneness to phenomena such as onomatopoesia and extensive borrowing. The method provides a fully automated way to generate customized Swadesh lists of any desired length, possibly adapted to a given geographical region. We apply the method to a large lexical database of Northern Eurasia, deriving a swadeshness ranking for more than 5,000 concepts expressed by German lemmas. We evaluate this ranking against existing shorter lists of basic concepts to validate the method, and give an English version of the 300 top concepts according to this ranking.	en
dc.language.iso	en	de_DE
dc.publisher	Universität Tübingen	de_DE
dc.rights	ubt-podok	de_DE
dc.rights.uri	http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=de	de_DE
dc.rights.uri	http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=en	en
dc.subject.classification	Sprachstatistik , Phylogenetik	de_DE
dc.subject.ddc	400	de_DE
dc.subject.other	Lexicostatistics	en
dc.subject.other	Swadesh lists	en
dc.subject.other	phylogenetic linguistics	en
dc.title	Using computational criteria to extract large Swadesh lists for lexicostatistics	en
dc.type	ConferencePaper	de_DE
utue.publikation.fachbereich	Allgemeine u. vergleichende Sprachwissenschaft	de_DE
utue.publikation.fakultaet	5 Philosophische Fakultät	de_DE
utue.publikation.fakultaet	5 Philosophische Fakultät	de_DE
utue.opus.portal	CPAL	de_DE

Dateien:	Dellert_Buch.pdf 165. KB PDF

Das Dokument erscheint in:

Zur Kurzanzeige

Veröffentlichen

Stöbern

Gesamter Bestand
Diese Sammlung

Mein Benutzerkonto

Einloggen

Using computational criteria to extract large Swadesh lists for lexicostatistics

DSpace Repositorium (Manakin basiert)

Das Dokument erscheint in:

Stöbern

Gesamter Bestand

Diese Sammlung

Mein Benutzerkonto