online natural language processing with word vectors

Install git clone cd words2map ./ Derive new vectors for words by searching online from words2map import * model = load_model() words = load_words("passions
Category: Python / Natural Language Processing
Watchers: 23
Star: 310
Fork: 37
Last update: Jan 16, 2022

Related Repos

dakrone Clojure library interface to OpenNLP - A library to interface with the OpenNLP (Open Natural Language Processing) library

JuliaText WordTokenizers Some basic tokenizers for Natural Language Processing. Installation: As per standard Julia package installation: pkg> add WordTokenizer

JuliaText CorpusLoaders A collection of various means for loading various different corpora used in NLP. Installation As per the standard Julia package installa

lancopku pkuseg:一个多领域中文分词工具包 (English Version) pkuseg 是基于论文[Luo et. al, 2019]的工具包。其简单易用,支持细分领域分词,有效提升了分词准确度。

machinalis __ _ _ _ ___ _ __ _ _ / _` | | | |/ _ \ '_ \| | | | | (_| | |_| | __/ |_) | |_| | \__, |\__,_|\___| .__/ \__, | |_| |_| |___/

machinalis About Yalign is a tool for extracting parallel sentences from comparable corpora. Statistical Machine Translation relies on parallel corpora (eg.. eur

isnowfy SnowNLP: Simplified Chinese Text Processing SnowNLP是一个python写的类库,可以方便的处理中文文本内容,是受到了TextBlob的启发而写的,由于现在大部分的自然语言处理库基本都是针对英文的,于是写了一个方便处理中文的类库,并且和TextBlob

columbia-applied-data-science Rosetta Tools for data science with a focus on text processing. Focuses on "medium data", i.e. data too big to fit into memory but too small to necess

proycon This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is regular-expression based, extensible, and advanced tokeniser written in C++

proycon Frog for Python This is a Python binding to the Natural Language Processing suite Frog. Frog is intended for Dutch and performs part-of-speech tagging

chartbeat-labs textacy: NLP, before and after spaCy textacy is a Python library for performing a variety of natural language processing (NLP) tasks, built on the hig

gugarosa NALP: Natural Adversarial Language Processing Welcome to NALP. Have you ever wanted to create natural text from raw sources? If yes, NALP is for you!

GrowingGit GitHub English Top Charts 「Help you discover excellent English projects and get rid of disturbing by other spoken language.」 Features • Definition of

BlackKakapo Icelandic Word Embeddings. Here you can find pre-trained corpora of word embeddings. Current methods: CBOW, Skip-Gram, Fast-Text (from Gensim library). The .vec and .model files are available for download (all in one archive).

dizzyliam tome A natural language library for Nim. import tome const text = """ There should be one and only one programming language for everything. That lan