I did Sentiment Analysis on Reddit(Yes, that includes r/Stocks!)


(A repost/update from here: https://www.reddit.com/r/stocks/comments/wrlxiy/i_did_sentiment_analysis_on_reddityes_that/)

Sentiment analysis is the quantifying of how positive, negative, or neutral a given text is. I've recently been applying this kind of analysis to various financial subreddits as a source of alternative data in an effort to create a trading algorithm. Finding if a strong correlation exists between sentiment and market movement is the first step. I thought it could be beneficial to everyone share the results here. I've organized the data in this website: https://sentimentlytics.herokuapp.com/

Any sort of discussion is welcome, but I'm specifically interested in these areas of discussion:

  • Has anyone else applied sentiment analytics to reddit and tried to find correlation between that and the markets? Were your results similar?
  • Is there a better open source model than VADER for this kind of text corpus?
  • My background is in Data, so my UI skills are lacking. Let me know what I can improve.
  • At the moment I've only included a bunch of the most common stocks/etfs I've seen. Additional recommendations for tickers that might be of interest would be appreciated.

Results TDLR: Long term(3 Months+), correlation between market and sentiment outputs are low. Short term there might be space to use specific search terms as markers for features.

If you want more information in regards to my process here it is: https://sentimentlytics.herokuapp.com/information

I applied the Industry Discussion flair, but I'm not entirely certain if that's the appropriate one. Let me know if it isn't please.


Comments

Leave a Reply

Your email address will not be published. Required fields are marked *