Home Page
About Us


The Natural Language Toolkit is a Python library for computational linguistics.


Scikit-learn is a machine-learning library for Python that provides simple and efficient tools for data analysis and data mining, with a focus on machine learning



The nltk library includes a confusion matrix that is simple to use and produces a nicer output than scikit-learn

from question  

Python tabulating confusion matrix

What you re looking for is linear regression and scikit-learn is much better than nltk for this see

from question  

NLTK: Document Classification with numeric score instead of labels

Or scikit-learn directly .for more details nltk 3.0 documentation

from question  

NLTK SVM Classifier Terminates

If you are worried about memory then do look into scikit-learn since equivalent models can use significantly less memory than nltk

from question  

NLTK on a production web application

Back to Home
Data comes from Stack Exchange with CC-BY-SA-4.0