Multinomial naive bayes

Pro Research Analysisby

Searched over 200M research papers

Analyzed relevant papers

Advances in Multinomial Naive Bayes for Text Classification

Introduction to Multinomial Naive Bayes (MNB)

Multinomial Naive Bayes (MNB) is a popular algorithm for text classification due to its simplicity, efficiency, and interpretability. It operates under the assumption that features (words) are conditionally independent given the class label, which often does not hold true in real-world applications. This article explores various advancements and modifications to the traditional MNB to address its limitations and improve its performance.

Structure Extended Multinomial Naive Bayes (SEMNB)

One significant advancement is the Structure Extended Multinomial Naive Bayes (SEMNB), which aims to weaken the attribute independence assumption by incorporating dependencies among features. SEMNB achieves this by averaging weighted one-dependence multinomial estimators, thus avoiding the complex structure learning process typical of Bayesian networks. Experimental results have shown that SEMNB significantly outperforms traditional MNB and other state-of-the-art algorithms1.

Multinomial Naive Bayes Tree (MNBTree)

Inspired by the Naive Bayes Tree (NBTree), the Multinomial Naive Bayes Tree (MNBTree) integrates a multinomial naive Bayes classifier at each leaf node of a decision tree. This hybrid approach enhances classification performance by leveraging the strengths of both decision trees and MNB. MNBTree uses information gain for tree construction, which reduces computational time, and its multiclass version, MMNBTree, further scales up its performance2.

Mixture of Latent Multinomial Naive Bayes (MLMNB)

The Mixture of Latent Multinomial Naive Bayes (MLMNB) introduces a latent variable to model dependencies among attributes, thus relaxing the independence assumption. This model avoids the complexities of structural learning and adapts to the naive Bayes classifier when the independence assumption holds. MLMNB has demonstrated substantial improvements in classification accuracy, conditional log-likelihood, and area under the ROC curve across various datasets3.

Improved Distance Correlation Coefficient Attribute Weighted MNB (IDCWMNB)

The Improved Distance Correlation Coefficient Attribute Weighted Multinomial Naive Bayes (IDCWMNB) enhances attribute weighting by using an improved distance correlation coefficient. This method measures the joint correlation of random vectors and incorporates document frequency and term weighting to allocate different weights to terms. IDCWMNB effectively balances classification accuracy and execution time, outperforming traditional statistical measurements5.

Hidden Multinomial Naive Bayes (HMNB)

Hidden Multinomial Naive Bayes (HMNB) adapts the hidden naive Bayes model to create a hidden parent for each feature, synthesizing the influences of other features. This approach avoids high computational complexity and can be extended to improve other models like complement naive Bayes (CNB) and one-versus-all-but-one (OVA). Extensive experiments validate the effectiveness of HMNB and its variants6.

Self-Structure Extended Multinomial Naive Bayes

The Self-Structure Extended Multinomial Naive Bayes combines the simplicity of MNB with the dependency modeling of SEMNB. This hybrid model aims to correct misclassifications by SEMNB due to the lack of direct dependencies between attributes, further enhancing classification accuracy7.

Conclusion

The advancements in Multinomial Naive Bayes, such as SEMNB, MNBTree, MLMNB, IDCWMNB, HMNB, and self-structure extended models, address the limitations of the traditional MNB by incorporating feature dependencies and improving attribute weighting. These modifications have demonstrated significant improvements in classification performance across various text datasets, making MNB a more robust and versatile tool for text classification tasks.

Sources and full results

Most relevant research papers on this topic

Structure extended multinomial naive Bayes

Highly Cited

2016·75citations·Liangxiao Jiang et al.·Inf. Sci.

Inf. Sci. ··DOI

Adapting naive Bayes tree for text classification

Highly Cited

2015·87citations·Shasha Wang et al.·Knowledge and Information Systems

Knowledge and Information Systems ··DOI

Mixture of latent multinomial naive Bayes classifier

2018·48citations·Nima Shiri Harzevili et al.·Appl. Soft Comput.

Appl. Soft Comput. ··DOI

Multinomial Naive Bayes for Text Categorization Revisited

Highly Cited

2004·429citations·Ashraf M. Kibriya et al.

·DOI

Weighted naïve Bayes text classification algorithm based on improved distance correlation coefficient

2021·17citations·Shufen Ruan et al.·Neural Computing and Applications

Neural Computing and Applications ··DOI

Adapting Hidden Naive Bayes for Text Classification

2021·4citations·Shengfeng Gan et al.·Mathematics

Mathematics ··DOI

Text Classification Using Self-Structure Extended Multinomial Naive Bayes

2020·3citations·A. Solanki et al.

·DOI

Online News Classification Using Multinomial Naive Bayes

2017·35citations·A. Rahman et al.

·DOI

The Multinomial Naïve Bayes Prediction Function

2016·0citations·B. Steele et al.

·DOI

Modifying Naive Bayes classifier for multinomial text classification

2016·19citations·N. Sharma et al.·2016 International Conference on Recent Advances and Innovations in Engineering (ICRAIE)

2016 International Conference on Recent Advances and Innovations in Engineering (ICRAIE) ··DOI

Try another search

prevention of PIVC related complications

the General Data Protection Regulation (GDPR) for financial applications

Library automation

sugarcanse bagasse as paper

role marketing mix on tourist revisit

leadership