High Performance Porter2 Stemmer

porter2 Porter2 implements the english Porter2 stemmer. It is written completely using finite state machines to do suffix comparison, rather than the string-based or tree-based approaches. As a result, it is 660% faster compare
Category: Golang / Natural Language Processing
Watchers: 3
Star: 44
Fork: 7
Last update: Jul 3, 2022

Related Repos

fredwu Stemmer An English (Porter2) stemming implementation in Elixir. In linguistic morphology and information retrieval, stemming is the process of reducin

ynqa Word Embeddings in Go wego is the implementations from scratch for word embeddings (a.k.a word representation) models in Go. What's word embeddings? W

naver SQLova SQLova is a neural semantic parser translating natural language utterance to SQL query. The name is originated from the name of our department: Search & QLova (Search & Clova). Authors Wonseok H

vodkabears yaspell Yaspell checks spelling of different texts with Yandex.Speller API. The tool is targeted to people, who works with texts and wants to get fast feedback about the quality. If you need to use API for your app, please, u

abadojack Whatlanggo Natural language detection for Go. Features Supports 84 languages 100% written in Go No external dependencies Fast Recognizes not only a language, but also a script (Latin, Cyrillic, etc)

shixzie nlp nlp is a general purpose any-lang Natural Language Processor that parses the data inside a text and returns a filled model Supported types int int8 int16 int32 int64 uint uint8 uint16 uint32 uint64

advancedlogic go-freeling Natural Language Processing in GO This is a partial port of Freeling 3.1 (http://nlp.lsi.upc.edu/freeling/). License is GPL to respect the License model of Freeling. This is the list of features already implemented

alexandres LexVec This is an implementation of the LexVec word embedding model (similar to word2vec and GloVe) that achieves state of the art results in multiple NLP tasks, as described in these papers. Pre-trained Vectors

danieldk dpar - Decaffeinated/Dependency Parser Introduction dpar is a transition-based dependency parser in Go. The goal of dpar is threefold: To provide a fast, traditional, feature-based, dependency parser. To provid

nicksnyder go-i18n go-i18n is a Go package and a command that helps you translate Go programs into multiple languages. Supports pluralized strings for all 200+ languages in the Unicode Common Locale Data Repository (CLDR). C

nuance GNLP A few structures for doing NLP analysis / experiments. Basics counter.Counter A map-like data structure for representing discrete probability distributions. Contains an underlying map of event -> probabi

rjohnsondev Go (golang) bindings for libstemmer This simple library provides Go (golang) bindings for the snowball libstemmer library including the popular porter and porter2 algorithms. Requirements You'll need the development p

danieldk Introduction This is a package for reading word2vec vectors in Go and finding similar words and analogies. Installation This package can be installed with the go command: go get gopkg.in/danieldk/go2vec.v1 To i

fiam Unicode transliterator (also known as unidecode) for Go Use the following command to install gounidecode go get -u github.com/fiam/gounidecode/unidecode Example usage package main import ( "fmt" "github.com/

goodsign About Cgo binding for libtextcat C library. Guaranteed compatibility with version 2.2. Installation Installation consists of several simple steps. They may be a bit different on your target system (e.g. require more p