- [x] Properly handle Named Entity Recognition - [x] Handle Punctuations - [x] Look for suitable Machine Algo to train -PCA, SVM, Artificial Neural Networks, Multiple Linear Regression - [x] Split into training and test set and train on all 8 sets of essays and calculate accuracy. Plot graphs. (Kappa Values) - [x] Improve Spelling Accuracy - [ ] Add more features in dataframe - No. of Stop Words in Sentence - Average length of Sentence in an essay - [ ] Improve code modularity (if possible, least preference) - [x] Normalize wrong spellings, word count (on Scale of 1-100) - [ ] Optimize code (Time Complexity and Space complexity) - [ ] If time permits, Use Word2Vec, TextRank to understand sentence Structure. ( Less preference)
- No. of Stop Words in Sentence
- Average length of Sentence in an essay