Exploring Tamil–Korean linguistic parallels: A computational and historical analysis of possible Pre-Hangul contact
Keywords:
comparative linguistics, computational linguistics Korean, maritime trade networks, TamilAbstract
This study investigates potential linguistic parallels between the Dravidian language Tamil and the Koreanic language Korean, evaluating whether observed similarities may reflect typological convergence, lexical coincidence, or indirect historical contact through maritime trade networks before the creation of Hangul in 1443 CE. The analysis compares phonological systems, consonant–vowel organization, morphological features, orthographic (letter stroke) patterns, and lexical correspondences, while applying computational string-similarity metrics to a dataset of 100 Korean–Tamil vocabulary pairs. Phonological inventories and historical script forms were compiled from established linguistic sources. Lexical similarity was measured using six computational models: Damerau–Levenshtein distance, Jaro similarity, Longest Common Subsequence (LCS), Cosine similarity, Jaccard similarity, and Ratcliff–Obershelp similarity. Hierarchical clustering (UPGMA) was employed to classify similarity levels among lexical pairs. The analysis produced the following scores: 0.5736 (Damerau–Levenshtein), 0.2255 (Jaro), 0.6001 (LCS), 0.4683 (Cosine), 0.3441 (Jaccard), and 0.5716 (Ratcliff–Obershelp), yielding an overall average similarity of approximately 56% across the dataset. Clustering results further identified groups of high, moderate, and low similarity. Both languages exhibit typological features commonly associated with agglutinative systems, including suffix-based morphology, Subject–Object–Verb (SOV) word order, and consonant–vowel syllable organization, as well as limited resemblances in orthographic stroke patterns.
Downloads
References
Acs, J., Hamerlik, E., Schwartz, R., Smith, N. A., & Kornai, A. (2024). Morphosyntactic probing of multilingual BERT models. Natural Language Engineering, 30, 753–792.
Arokiyaraj, S., Ravichandran, G., Chozhan, A., & Narayanan, K. (2021). Korean–Tamil language and cultural similarities, maritime trade between early historic Tamilakam and Korea. Shanlax International Journal of Arts, Science and Humanities, 8(3), 28–36.
Ashok, S. (2022). Stalin’s archaeology push in Tamil Nadu is the stuff of culture wars. ThePrint.
Batubara, N. A., & Widayati, D. (2022). Language kinship of English, German, and Dutch: A comparative historical linguistic study. International Journal of Humanities Education and Social Sciences, 1(6), 1016–1024.
Campbell, L. (2013). Historical linguistics: An introduction (3rd ed.). Edinburgh University Press.
Champakalakshmi, R. (1996). Trade, ideology and urbanization: South India 300 BC to AD 1300. Oxford University Press.
Clippinger, M. E. (1984). Korean and Dravidian: Lexical evidence for an old theory. Korean Studies, 8, 1–57.
Dayalan, D. (2013). Tamil Brahmi script on amphora sherd found at Khor Rori-Sumharam, Oman. Epigraphy of the Orient, 30, 146–148.
Dayalan, D. (2024a). Ancient seaports of Tamil Nadu and Kerala and their trade network. In A. Parasher Sen (Ed.), Handbook on urban history of early India. Springer.
Dayalan, D. (2024b). Cultural and trade links between India and Siam. Acta Via Serica, 9(1), 67–90.
Delmestri, A., & Cristianini, N. (2012). Linguistic phylogenetic inference by PAM-like matrices. Journal of Quantitative Linguistics, 19(2), 95–120.
Dinesh Kumar, M., Prasath, R., & Rajendran, P. (2018). Cross-language transliteration using string similarity metrics. International Journal of Computational Linguistics, 9(2), 45–56.
Dussubieux, L., Gratuze, B., & Blet-Lemarquand, M. (2010). Mineral soda alumina glass. Journal of Archaeological Science, 37(7), 1645–1655.
Glover, I., & Kenoyer, J. M. (2019). Overlooked imports: Carnelian beads in the Korean Peninsula. Asian Perspectives, 58(1), 180–201.
Guy, J. (2001). The Galle trilingual inscription. Journal of the Royal Asiatic Society, 11(3), 1–21.
Hae-Young, W. (2021). Along the sea turtle trail. Journal of East-West Comparative Literature, 57, 199–224.
Han, J.-S. (2016). Foreigners and their social integration in Yuan China: The case of Quanzhou. Journal of Asian History, 50(1), 45–70.
Hockett, C. F. (1963). The problem of universals in language. In J. Greenberg (Ed.), Universals of language. MIT Press.
Hulbert, H. B. (1905). A comparative grammar of the Korean language and the Dravidian languages of India. Methodist Publishing House.
Jaro, M. A. (1989). Advances in record-linkage methodology. Journal of the American Statistical Association, 84(406), 414–420.
Kang, G. U. (1990). ???? ?????? ?? [A comparative linguistic study of ancient history]. Saemunsa.
Keraf, G. (1991). Linguistik bandingan historis. Gramedia.
Kim, K. (1999). A new proposal for a standard Hangul code. Computer Standards & Interfaces, 20, 243–257.
Kim-Renaud, Y.-K. (1997). The Korean alphabet. University of Hawai‘i Press.
Kokarneswaran, M., Selvaraj, P., Ashokan, T., Perumal, S., Sellappan, P., Murugan, K. D., ... & Chandrasekaran, V. (2020). Discovery of carbon nanotubes in sixth century BC potteries from Keeladi, India. Scientific reports, 10(1), 19786.
Lee, K. H., & Yi, K. (2017). Kory?’s trade with the outer world. Korean Studies, 41, 52–74.
Lee, K. R. (2017). A study on the cultural contacts between Garak Kingdom and ancient South India: With special reference to fish worship. Journal of Indian Studies, 22(1), 85–121. https://doi.org/10.21758/jis.2017.22.1.85
Li, H., & Dunn, J. (2022). Corpus similarity measures remain robust. Lingua, 275, 103377.
Liu, D., & Tang, X. (2024). Comparative linguistic analysis with Firthian collocations: Cases of synonym differentiation and proficiency assessment. Lingua, 306, 103755. https://doi.org/10.1016/j.lingua.2024.103755
Liyanarachchi, G. (2013). The Periplus of the Erythraean Sea. Accounting History, 18(2), 277–279.
Mahadevan, I. (2003). Early Tamil epigraphy. Harvard University Press.
Ngoc, L. T., et al. (2018). Named entity translation using Levenshtein distance. In Proceedings of the international conference on language resources.
Ohnmar, K., et al. (2013). Cross-lingual phonetic similarity for loanword identification. In Proceedings of IJCNLP (pp. 1261–1268).
Pakhomov, S. V., & Hemmy, L. S. (2014). A computational linguistic measure of clustering behavior on semantic verbal fluency task predicts risk of future dementia in the Nun Study. Cortex, 55, 97-106. https://doi.org/10.1016/j.cortex.2013.05.009
Pathmanathan, R., Pearce, J., Kjeldskov, J., & Smith, W. (2011). Using mobile phones for promoting water conservation. In Proceedings of the 23rd Australian Computer-Human Interaction Conference (pp. 243-252).
Reshma, V. M., & Mathew, L. S. (2015). Longest common subsequence method. IOSR Journal of Computer Engineering, 17(6), 1–7.
Rinjeni, T. P., Indriawan, A., & Rakhmawati, N. A. (2024). Matching scientific article titles using Cosine Similarity and Jaccard Similarity algorithm. Procedia Computer Science, 234, 553-560. https://doi.org/10.1016/j.procs.2024.03.039
Santos, R., Murrieta-Flores, P., & Martins, B. (2017). Combining string similarity metrics. International Journal of Digital Earth, 11(9), 913–938.
Schottenhammer, A. (2008). The East Asian Mediterranean. Harrassowitz.
Sen, T. (2006a). The Yuan dynasty and the Indian Ocean. Journal of the Economic and Social History of the Orient, 49(3), 415–445.
Sen, T. (2006b). Buddhism, diplomacy, and trade. University of Hawai‘i Press.
Sidebotham, S. E. (2011). Berenike and the ancient maritime spice route. University of California Press.
Sivanantham, R., & Seran, M. (2019). Keeladi. Government of Tamil Nadu.
Steever, S. B. (Ed.). (2019). The Dravidian languages (2nd ed.). Routledge.
Suresh Kumar, D. (2025). Keeladi excavation report controversy. The Hindu.
Swadesh, M. (1952). Lexico-statistic dating of prehistoric ethnic contacts: with special reference to North American Indians and Eskimos. Proceedings of the American philosophical society, 96(4), 452-463.
Tanaka, S. (2015). Consonantal stability in historical phonology. Journal of Historical Linguistics.
Tanaka-Ishii, K. (2015). Consonants as skeleton of language. In Language production. Springer.
Thanabalasingam, U. (2023). A phonetic comparison of Korean and Tamil. Open Journal of Modern Linguistics, 13, 711–733.
The Academy of Korean Studies. (2017). A history of Korea.
Verma, S. P. (2005a). Trade and cultural contacts in the Indian Ocean world. Manohar.
Verma, V. K. (2005b). Maritime trade between early historic Tamil Nadu. Proceedings of the Indian History Congress, 66, 125–134.
Yi, J., et al. (2022). Compositional analysis of early glass beads. Journal of Archaeological Science: Reports, 41, 103293.
Zhang, H. (2014a). Stroke order in Hanzi handwriting. Cercles, 24(1), 67–85.
Zhang, Q. (2014b). Stroke structure analysis. Writing Systems Research.
Zhao, C., & Sahni, S. (2020). String correction algorithm using Damerau-Levenshtein distance. BMC Bioinformatics, 21, 14.
Published
How to Cite
Issue
Section
Copyright (c) 2026 Linguistics and Culture Review

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.



