Text normalization library for Python

normalizr Normalizr is a Python library for text normalization that offers a bunch of actions to manipulate your text as much as you want. With normalizr you can replace symbols, punctuation, remove stop words and much more.

Related Repos



aio-libs yarl Introduction Url is constructed from str: >>> from yarl import URL >>> url = URL('https://www.python.org/~guido?arg=1#frag') >>> url URL('https://www.python.org/~guido
 

Jonwing morphling Morphling is a convenient tool that converts Markdown to HTML. Usage Command Line Mode python -m morphling <markdown file> [options...] Use morphling in your code fro
 

lark-parser Lark - a modern parsing library for Python Parse any context-free grammar, FAST and EASY! Beginners: Lark is not just another parser. It can parse any grammar you throw at it, no matter how complicated or ambiguous, and do so ef
 

sloria TextBlob: Simplified Text Processing Homepage: https://textblob.readthedocs.io/ TextBlob is a Python (2 and 3) library for processing textual data. It provides a simple API for diving into common natural language processing
 

facebook Duckling Duckling is a Haskell library that parses text into structured data. "the first Tuesday of October" => {"value":"2017-10-03T00:00:00.000-07:00","grain":"day"} Requirements A Haskell environment is req
 

fxsjy jparser A readability parser which can extract title, content, images from html pages Install: pip install jparser (requirement: lxml) Usage Example: import urllib2 from jparser import PageModel html = urllib2.
 

santalu Mask EditText Sample Usage Gradle allprojects { repositories { maven { url 'https://jitpack.io' } } } dependencies { implementation 'com.github.santalu:mask-edittext:1.1.1' }
 

sevagas macro_pack Short description The macro_pack is a tool used to automatize obfuscation and generation of retro formats such as MS Office documents or VBS like format. Now it also handles various shortcuts formats. This