Today Svetlin Nakov defended successfully his PhD thesis titled “Automatic Extraction of False Friends from Parallel Bilingual Corpus” and was awarded with the scientific and educational degree “Doctor of Philosopy” (PhD) in Informatics in the area of computational linguistics. The thesis was defended according to the Bulgarian law, in front of the Specialized Scientific Council
Today I granted to the community (under MIT license) the source code of the most interesting algorithms designed for my PhD thesis (implemented in C#): MMEDR – algorithm for measuring weighted orthographic similarity between Bulgarian and Russian words taking into account some linguistically motivated Bulgarian-Russian correspondences (current supports Bulgarian and Russian only) SemSim – algorithm
Today I presented at the prestigious scientific conference RANLP’2009 a research paper about new methods of extraction of false friends from parallel corpora, which is a major part of my PhD thesis. The article is named “Unsupervised Extraction of False Friends from Parallel Bi-Texts Using the Web as a Corpus” and was accepted after passing