AI4I-3-Q: Exploratory Data Analysis Quiz - could not derive at the answer
I am not able to arrive at the answer for the last question of this quiz:
You notice that there are some unstructured text comments in
‘Q9: OTHER COMMENTS’.
You are curious how many sets of 2n-grams there are.
How many 2n-grams are there?
I would like to understand how the answer was derived. Grateful if someone can help advise?
The code I provided is attached. Looks same to yours but I may be wrong.
How do we get the 7047 as the answer?
cv_trigram_vec = CountVectorizer(max_features=100, stop_words='english', ngram_range = (2,2))
# Fit and apply trigram vectorizer
cv_trigram = cv_trigram_vec.fit_transform(df['Q9: OTHER COMMENTS'].apply(lambda x: np.str_(x)))