期刊名称:Anuario del Seminario de Filología Vasca "Julio de Urquijo"
印刷版ISSN:0582-6152
出版年度:2010
卷号:0
期号:0
页码:41-56
语种:English
出版社:Anuario del Seminario de Filología Vasca "Julio de Urquijo"
摘要:This article surveys recent developments furthering dialectometric research which the authors have been involved in, in particular techniques for measuring large numbers of pronunciations (in phonetic transcription) of comparable words at various sites. Edit distance (also known as Levenshtein distance) has been deployed for this purpose, for which refinements and analytic techniques continue to be developed. The focus here is on (i) an empirical approach, using an information-theoretical measure of mutual information, for deriving the appropriate segment distances to serve within measures of sequence distance; (ii) a heuristic technique for simultaneously aligning large sets of comparable pronunciations, a necessary step in applying phylogenetic analysis to sound segment data; (iii) spectral clustering, a technique borrowed from bio-informatics, for identifying the (linguistic) features responsible for (dialect) divisions among sites; (iv) techniques for studying the (mutual) comprehensibility of closely related varieties; and (v) Séguy’s law, or the generality of sub-linear diffusion of aggregate linguistic variation.