sciquence.text_processing.Word2Idx

class sciquence.text_processing.Word2Idx[source]

Class used for for transforming text data into word indices

__init__()[source]

x.__init__(…) initializes x; see help(type(x)) for signature

Methods

__init__() x.__init__(…) initializes x; see help(type(x)) for signature
fit(X[, y]) Fit WordEncoder object
fit_transform(X[, y]) Fit WordEncoder and transform list of tokenized sentences (or raw text) into lists of indices
inverse_transform(X)
partial_fit(X[, y]) Partially fit WordEncoder to the given word set
transform(X[, y]) Transform list of tokenized sentences (or raw text) into lists of indices