Homonym extraction in natural language processing

Pro Research Analysisby

Searched over 200M research papers

Analyzed relevant papers

Homonym Extraction in Natural Language Processing: Key Methods and Approaches

Importance of Homonym Extraction and Word Sense Disambiguation

Homonym extraction is a crucial task in natural language processing (NLP) because homonyms—words with the same spelling or pronunciation but different meanings—can cause ambiguity in text understanding and machine translation. Accurately identifying and distinguishing homonyms is essential for tasks like word sense disambiguation, semantic analysis, and improving translation accuracy Elov2023Elov2022Abdullah2023+1 MORE.

Machine Learning Approaches for Homonym Extraction

Machine learning methods, especially the Naive Bayes classifier, are widely used for homonym extraction due to their simplicity and speed. In the context of the Uzbek language, the Naive Bayes classifier has been shown to effectively distinguish homonyms among grammatically similar word groups, making it a popular choice for multi-class classification tasks in NLP . Depending on the data, different types of Naive Bayes algorithms (Gaussian, Polynomial, Bernoulli) can be applied to optimize performance .

Linguistic and Mathematical Modeling of Homonyms

Semantic analysis often involves grouping homonyms based on their occurrence within different parts of speech. For example, in Uzbek, homonyms are categorized into groups such as adjective–noun–adverb or noun–pronoun–verb, and mathematical models are developed to differentiate these groups. This structured approach helps in systematically identifying homonyms and understanding their linguistic context .

Homonym and Polysemy Feature Extraction in Machine Translation

Homonym and polysemy extraction is particularly important in machine translation, where ambiguous words can lead to translation errors. Recent research in Indonesian-English machine translation uses part-of-speech (POS) tagging, word similarity measures (like Word2vec and BERT embeddings), and synonym-based term expansion to extract homonyms and polysemy features. These features are compiled into dictionaries and used to improve translation accuracy by updating terms based on semantic similarity Abdullah2023Harjo2024. Morphology extraction, including the detection of prefixes, lemmas, and suffixes, further enhances the identification of homonyms in morphologically rich languages .

Evaluation and Impact on Translation Accuracy

The integration of homonym and polysemy extraction methods in neural machine translation systems has led to measurable improvements in translation quality. For instance, systems that incorporate these features have demonstrated higher precision, recall, F1 measure, and overall accuracy compared to baseline models, confirming the value of targeted homonym extraction in practical NLP applications .

Conclusion

Homonym extraction is a foundational task in NLP, supporting accurate semantic analysis and machine translation. Machine learning classifiers like Naive Bayes, linguistic modeling, and advanced feature extraction techniques (including morphology and semantic similarity) are effective strategies for identifying and handling homonyms. These approaches collectively enhance the performance of NLP systems by reducing ambiguity and improving the accuracy of language understanding and translation Elov2023Elov2022Abdullah2023+1 MORE.

Sources and full results

Most relevant research papers on this topic

SO‘Z MA’NOSINI ANIQLASHDA NAIVE BAYES ALGORITMIDAN FOYDALANISH

The Naive Bayes classifier is a simple and fast method for eliminating homonymy between grammatically similar groups of words in the Uzbek language.

2023·

0citations

·B. Elov et al.

Journal of Science and Innovative Development·

DOI

Hypernym extraction from Wikipedia and Wiktionary

This paper demonstrates the extraction of Hypernym-Hyponym relations from Wikipedia and Wiktionary using Finite State Machines, a powerful tool for natural language processing.

2017·

2citations

·Emre Sasmaz et al.

2017 25th Signal Processing and Communications Applications Conference (SIU)·

DOI

Learning Syntactic Patterns for Automatic Hypernym Discovery

Our algorithm automatically learns hypernym relations from text, achieving higher precision and recall than WordNet, making it a valuable tool for natural language processing applications.

2004·

836citations

·R. Snow et al.

MODELING OF BUSINESS PROCESSES THAT DISTINGUISH HOMONYMY WITHIN THREE PARTS OF SPEECH

This study develops 7 mathematical models to differentiate homonyms within three parts of speech in the Uzbek language, using 11 linguistic factors.

2022·

0citations

·Botir Elov et al.

Journal of Science and Innovative Development·

DOI

Homonym and Polysemy Approaches in Term Weighting for Indonesian-English Machine Translation

This research proposes a method to extract homonyms and polysemy in Indonesian, improving Indonesian-English Machine Translation accuracy by combining word similarity and semantic similarity.

2023·

5citations

·Rachmad Abdullah et al.

2023 14th International Conference on Information & Communication Technology and System (ICTS)·

DOI

Homonym and polysemy approaches with morphology extraction in weighting terms for Indonesian to English machine translation

This research proposes homonym and polysemy approaches with morphology extraction in weighting terms for Indonesian to English machine translation, aiming to capture homonym and polysemy features and improve translation accuracy.

2024·

0citations

·Budi Harjo et al.

International Journal of Electrical and Computer Engineering (IJECE)·

DOI

A Comparative Study on Keyword Extraction and Generation of Synonyms in Natural Language Processing

The extreme learning machine (ELM) model outperforms the rule-based and statistical models in keyword extraction and synonym generation for natural language processing.

2023·

1citation

·Rasmi Rani Dhala et al.

2023 International Conference in Advances in Power, Signal, and Information Technology (APSIT)·

DOI

Investigating Natural Language Techniques for Accurate Noun and Verb Extraction

SpaCy and POS technology tagging achieve high accuracy in extracting nouns and verbs from text, with potential applications across diverse language processing tasks and industries.

2024·

13citations

·Reshma P. Nair et al.

DOI

Keyword extraction method for machine reading comprehension based on natural language processing

The proposed keyword extraction method using natural language processing technology improves machine reading comprehension accuracy compared to traditional methods.

2021·

4citations

·Ruiheng Li et al.

Journal of Physics: Conference Series·

DOI

Extracting cancer concepts from clinical notes using natural language processing: a systematic review

NLP algorithms, particularly rule-based algorithms, are highly accurate and sensitive in extracting cancer concepts from clinical notes, suggesting their potential use in other diseases as well.

Systematic Review

2023·

45citations

·M. Gholipour et al.

BMC Bioinformatics·

DOI

Try another search

prevention of allergies in children

improving concentration and memory techniques

heart attack treatment and management

ICD classification of depressive disorders

physician occupational health

atomoxetine pharmacology