Twitter Data Analysis

Summary

  • Solved the asymmetric problem of producers and consumers by implementing a high performance parallel data fetching system using RabbitMQ
  • Performed sentiment analysis with NLP and various Machine Learning models
  • Built a web UI that communicates with backend RESTful APIs to visualize data
  • Fulfilled database design in MySQL to store and retrieve analytic results
  • Performed Unit Tests using Junit to test functionalities
  • Developed a scalable pipeline using Java and Play Framework
  • Scaling the project using Hadoop
Avatar
Guoxin Sun
Advisory Engineer

Data Scientist, empowering the data-driven business leadership.