Lenovo, Kaeshev and Leo, Day 4

It turns out 20,000 review is a lot, and it’s not all smooth sailing. We began running into difficulty in separating data into meaningful categories (especially since 1/3 of people didn’t write more than half a sentence), and our code began to have some issues. Besides looking for solutions, we did some more detailed analysis on the smaller sample sizes, and gave our report.

Hashmap diagram (what we are trying to get to work):

Lenovo, Kaeshev and Leo, Day 2

Today we finished our code from yesterday, which helped us sort, categorize, and find various trends throughout consumer feedback Lenovo received. We were able to find keywords in the reviews with our program and determine the frequency of the keywords, which were then used to analyze the review score and satisfaction regarding instances of those words to find possible correlations between the words and customer satisfaction (or dissatisfaction). Using these methods, we were able to iterate through the 1000 reviews that were given to us and find the similarities and trends between them. We presented our findings to the WEP host.

Here is a small snippet of the report of our findings of a 299 sample size:

Skip to toolbar