NLP: Qualitatively “positive” vs “negative” sentence?

What you are looking for is commonly dubbed Sentiment Analysis Typically, sentiment analysis is not able to handle delicate subtleties, like sarcasm or irony, but it fares pretty well if you throw a large set of data at it.

What you are looking for is commonly dubbed Sentiment Analysis. Typically, sentiment analysis is not able to handle delicate subtleties, like sarcasm or irony, but it fares pretty well if you throw a large set of data at it. Sentiment analysis usually needs quite a bit of pre-processing.At least tokenization, sentence boundary detection and part-of-speech tagging.

Sometimes, syntactic parsing can be important. Doing it properly is an entire branch of research in computational linguistics, and I wouldn't advise you with coming up with your own solution unless you take your time to study the field first. OpenNLP has some tools to aid sentiment analysis, but if you want something more serious, you should look into the LingPipe toolkit.

It has some built-in SA-functionality and a nice tutorial. And you can train it on your own set of data, but don't think that it is entirely trivial :-). Googling for the term will probably also give you some resources to work with.

If you have any more specific question, just ask, I'm watching the nlp-tag closely ;-).

Amazingly useful - thanks a bunch, Aleksandar! – Alex Weinstein Sep 24 '08 at 16:38.

Alex, Some approaches to sentiment analysis use strategies popular on other text classification tasks. The most common being transforming your film review into a word vector, and feeding it into a classifier algorithm as training data. Most popular data mining packages can help you here.

You could have a look at this tutorial on sentiment classification illustrating how to do an experiment using the open source RapidMiner toolkit. Incidentally, there is a good data set made available for research pusposes related to detecting opinion on film reviews. It is based on IMDB user reviews, and you can check many related research work on the area and how they use the data set.

Its worth bearing in mind that the effectiveness of these methods can only be judged from a statistical viewpoint, so you can pretty much assume there will be misclassifications and cases where opinion is hard to detect. As already noticed in this thread, detecting things like irony and sarcasm can be very difficult indeed. B.

Try stemming with WordNet; you may have to augment the vocabulary with positive/negative weights to get what you want though.

A Naïve Bayesian Classifier would do the job, and it's very simple to implement.

Humans only agree with each other 70% of the time even in polarity sentiment analysis. en.wikipedia.org/wiki/Sentiment_analysis Jeff's point, while not very encouraging, does accurately reflect how extremely difficult is to get anywhere with this.

I cant really gove you an answer,but what I can give you is a way to a solution, that is you have to find the anglde that you relate to or peaks your interest. A good paper is one that people get drawn into because it reaches them ln some way.As for me WW11 to me, I think of the holocaust and the effect it had on the survivors, their families and those who stood by and did nothing until it was too late.

Related Questions