habitat.datasets.utils.VocabFromText class

Methods

def get_size(self)
def get_unk_index(self)
def get_unk_token(self)
def idx2word(self, n_w)
def token_idx_2_string(self, tokens: typing.Iterable[int]) -> str
def tokenize_and_index(self, sentence, regex = re.compile('([^\\w-]+)'), keep = "'s", remove = (',', '?')) -> typing.List[int]
def word2idx(self, w)

Special methods

def __format__(self, format_spec, /): Default object formatter.
def __init__(self, sentences, min_count = 1, regex = re.compile('([^\\w-]+)'), keep = (), remove = (), only_unk_extra = False)
def __len__(self)

Data

DEFAULT_TOKENS = ['<pad>', '<unk>', '<s>', '</s>']
END_TOKEN = '</s>'
PAD_TOKEN = '<pad>'
START_TOKEN = '<s>'
UNK_TOKEN = '<unk>'

Method documentation

def habitat.datasets.utils.VocabFromText.format(self, format_spec, /)

Default object formatter.

Return str(self) if format_spec is empty. Raise TypeError otherwise.

Tab / T to search, Esc to close

…

Search for modules, classes, functions and other symbols. You can omit any prefix from the symbol path; adding a . suffix lists all members of given symbol.

Use ↓ / ↑ to navigate through the list, Enter to go. Tab autocompletes common prefix, you can copy a link to the result using ⌘ L while ⌘ M produces a Markdown link.

Sorry, nothing was found.
Maybe try a full-text search with external engine?