Feature extraction from text and images.

FUNCTION

sklearn.feature_extraction

Multi-select
Status
Not started
URL
  • sklearn.feature_extraction.text: Text feature extraction.
  • sklearn.feature_extraction.image: Image feature extraction.

šŸ” Study Note: sklearn.feature_extraction

User guide.Ā See theĀ Feature extractionĀ section for further details.

Transforms lists of feature-value mappings to vectors.
Implements feature hashing, aka the hashing trick.

From images

Utilities to extract features from images.

Extracts patches from a collection of images.
Reshape a 2D image into a collection of patches.
Graph of the pixel-to-pixel connections.
Graph of the pixel-to-pixel gradient connections.
Reconstruct the image from all of its patches.

From text

Utilities to build feature vectors from text documents.

Convert a collection of text documents to a matrix of token counts.
Convert a collection of text documents to a matrix of token occurrences.
Transform a count matrix to a normalized tf or tf-idf representation.
Convert a collection of raw documents to a matrix of TF-IDF features.
‣

DictVectorizer