from gensim import corpora,models,similarities
from sklearn.datasets import fetch_20newsgroups as getData
from sklearn.model_selection import train_test_split
from collections import defaultdict
import re
import numpy as np
corpus = getData(subset='train', remove=('headers','footers','quotes'))
X = corpus.data
Y = corpus.target
Y_names = corpus.target_names
print(X[0],len(X),Y_names[Y[0]])
I was wondering if anyone out there could enlighten me on this car I saw the other day. It was a 2-door sports car, looked to be from the late 60s/ early 70s. It was called a Bricklin. The doors were really small. In addition, the front bumper was separate from the rest of the body. This is all I know. If anyone can tellme a model name, engine specs, years of production, where this car is made, history, or whatever info you have on this funky looking car, please e-mail. 11314 rec.autos
X_train,X_test,Y_train,Y_test = train_test_split(X,Y,test_size=0.2)
stoplist=set('for a of the and to in'.split(' '))
texts = [[word for word in re.split('\W+', doc.lower()) if word not in stoplist] for doc in X_train]