Natural Language Processing

Libraries for working with human languages.

Newest releases

ynqa Word Embeddings in Go wego is the implementations from scratch for word embeddings (a.k.a word representation) models in Go. What's word embeddings? W

fredwu Stemmer An English (Porter2) stemming implementation in Elixir. In linguistic morphology and information retrieval, stemming is the process of reducin

naver SQLova SQLova is a neural semantic parser translating natural language utterance to SQL query. The name is originated from the name of our department: Search & QLova (Search & Clova). Aut

vodkabears yaspell Yaspell checks spelling of different texts with Yandex.Speller API. The tool is targeted to people, who works with texts and wants to get fast feedback about the quality. If you need to use API for your app,

abadojack Whatlanggo Natural language detection for Go. Features Supports 84 languages 100% written in Go No external dependencies Fast Recognizes not only a language, but also a script (Latin, Cyrilli

shixzie nlp nlp is a general purpose any-lang Natural Language Processor that parses the data inside a text and returns a filled model Supported types int int8 int16 int32 int64 uint uint8 uint16 uin

advancedlogic go-freeling Natural Language Processing in GO This is a partial port of Freeling 3.1 ( License is GPL to respect the License model of Freeling. This is the list of features already i

dchest Stemmer package for Go Stemmer package provides an interface for stemmers and includes English, German and Dutch stemmers as sub-packages: porter2 sub-package implements English (Porter2) stemmer as described in ht

pebbe A Go package for n-gram based text categorization, with support for utf-8 and raw text. To do: write documentation make it faster Keywords: text categorization, language detector Install go get

goodsign Description Snowball stemmer port (cgo wrapper) for Go. Provides word stem extraction functionality. For more detailed info see Installing go get go

neurosnap Sentences - A command line sentence tokenizer This command line utility will convert a blob of text into a list of sentences. Demo Docs Install go get go install

alexandres LexVec This is an implementation of the LexVec word embedding model (similar to word2vec and GloVe) that achieves state of the art results in multiple NLP tasks, as described in these papers. Pre-trained Vect

zentures porter2 Porter2 implements the english Porter2 stemmer. It is written completely using finite state machines to do suffix comparison, rather than the string-based or tree-based approaches. As a result, it is 660% fast

a2800276 Porter Stemmer for Go This is a fairly straighforward port of Martin Porter's C implementation of the Porter stemming algorithm. The C version this port is based on is available for download here:

fiam Unicode transliterator (also known as unidecode) for Go Use the following command to install gounidecode go get -u Example usage package main import ( "fmt" "g

blevesearch segment A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29 Features Currently only segmentation at Word Boundaries is supported. License Apache

rookii ##Golang Implementation of the Paice/Husk stemming algorithm This project was created for the QUT course INB344. Details on the algorithm can be found here. This implementation is primarily based on the ANSI C Implementationn by A

awsong MMSEGO This is a GO implementation of MMSEG which a Chinese word splitting algorithm. TO DO list Documentation/comments Benchmark Usage #Input Dictionary Format Key\tFreq Each key occ

goodsign About Cgo binding for icu4c C library detection and conversion functions. Guaranteed compatibility with version 50.1. Installation Installation consists of several simple steps. They may be a bit different o

goodsign About Cgo binding for libtextcat C library. Guaranteed compatibility with version 2.2. Installation Installation consists of several simple steps. They may be a bit different on your target system (e.g. requ

danieldk Introduction This is a package for reading word2vec vectors in Go and finding similar words and analogies. Installation This package can be installed with the go command: go get

rjohnsondev Go (golang) bindings for libstemmer This simple library provides Go (golang) bindings for the snowball libstemmer library including the popular porter and porter2 algorithms. Requirements You'll need the dev

nuance GNLP A few structures for doing NLP analysis / experiments. Basics counter.Counter A map-like data structure for representing discrete probability distributions. Contains an underlying map of event -&g

nicksnyder go-i18n go-i18n is a Go package and a command that helps you translate Go programs into multiple languages. Supports pluralized strings for all 200+ languages in the Unicode Common Locale Data Repository (CLDR).

danieldk dpar - Decaffeinated/Dependency Parser Introduction dpar is a transition-based dependency parser in Go. The goal of dpar is threefold: To provide a fast, traditional, feature-based, dependency parser.