SQLova
SQLova is a neural semantic parser translating natural language utterance to SQL query. The name is originated from the name of our department: Search & QLova (Search & Clova).
Aut
yaspell
Yaspell checks spelling of different texts with Yandex.Speller API. The tool is targeted to people, who works with texts and wants to get fast feedback about the quality. If you need to use API for your app,
Whatlanggo
Natural language detection for Go.
Features
Supports 84 languages
100% written in Go
No external dependencies
Fast
Recognizes not only a language, but also a script (Latin, Cyrilli
nlp
nlp is a general purpose any-lang Natural Language Processor that parses the data inside a text and returns a filled model
Supported types
int int8 int16 int32 int64
uint uint8 uint16 uin
go-freeling
Natural Language Processing in GO
This is a partial port of Freeling 3.1 (http://nlp.lsi.upc.edu/freeling/).
License is GPL to respect the License model of Freeling.
This is the list of features already i
Stemmer package for Go
Stemmer package provides an interface for stemmers and includes English, German and Dutch stemmers as sub-packages:
porter2 sub-package implements English (Porter2) stemmer as described in ht
A Go package for n-gram based text categorization, with support for utf-8 and raw text.
To do:
write documentation
make it faster
Keywords: text categorization, language detector
Install
go get github.com/peb
Description
Snowball stemmer port (cgo wrapper) for Go. Provides word stem extraction functionality. For more detailed info see http://snowball.tartarus.org/
Installing
go get github.com/goodsign/snowball
go
Sentences - A command line sentence tokenizer
This command line utility will convert a blob of text into a list of sentences.
Demo
Docs
Install
go get gopkg.in/neurosnap/sentences.v1
go install
LexVec
This is an implementation of the LexVec word embedding model (similar to word2vec and GloVe) that achieves state of the art results in multiple NLP tasks, as described in these papers.
Pre-trained Vect
porter2
Porter2 implements the english Porter2 stemmer. It is written completely using finite state machines to do suffix comparison, rather than the string-based or tree-based approaches. As a result, it is 660% fast
Porter Stemmer for Go
This is a fairly straighforward port of Martin Porter's C implementation of the Porter stemming algorithm. The C version this port is based on is available for download here: http://tartarus.org/~m
Unicode transliterator (also known as unidecode) for Go
Use the following command to install gounidecode
go get -u github.com/fiam/gounidecode/unidecode
Example usage
package main
import (
"fmt"
"g
segment
A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29
Features
Currently only segmentation at Word Boundaries is supported.
License
Apache
##Golang Implementation of the Paice/Husk stemming algorithm This project was created for the QUT course INB344. Details on the algorithm can be found here. This implementation is primarily based on the ANSI C Implementationn by A
MMSEGO
This is a GO implementation of MMSEG which a Chinese word splitting algorithm.
TO DO list
Documentation/comments
Benchmark
Usage
#Input Dictionary Format
Key\tFreq
Each key occ
About
Cgo binding for icu4c C library detection and conversion functions. Guaranteed compatibility with version 50.1.
Installation
Installation consists of several simple steps. They may be a bit different o
About
Cgo binding for libtextcat C library. Guaranteed compatibility with version 2.2.
Installation
Installation consists of several simple steps. They may be a bit different on your target system (e.g. requ
Introduction
This is a package for reading word2vec vectors in Go and finding similar words and analogies.
Installation
This package can be installed with the go command:
go get gopkg.in/danieldk/go2vec
Go (golang) bindings for libstemmer
This simple library provides Go (golang) bindings for the snowball libstemmer library including the popular porter and porter2 algorithms.
Requirements
You'll need the dev
GNLP
A few structures for doing NLP analysis / experiments.
Basics
counter.Counter
A map-like data structure for representing discrete probability distributions. Contains an underlying map of event -&g
go-i18n
go-i18n is a Go package and a command that helps you translate Go programs into multiple languages.
Supports pluralized strings for all 200+ languages in the Unicode Common Locale Data Repository (CLDR).
dpar - Decaffeinated/Dependency Parser
Introduction
dpar is a transition-based dependency parser in Go. The goal of dpar is threefold:
To provide a fast, traditional, feature-based, dependency parser.