๐Ÿญ Korean Sentence Embedding Repository

Korean-Sentence-Embedding ๐Ÿญ Korean sentence embedding repository. You can download the pre-trained models and inference right away, also it provides
Information
Category: Python / Natural Language Processing
Watchers: 1
Star: 6
Fork: 0
Last update: Dec 30, 2021

Related Repos



youzanai T'rex Park๏ผˆ้œธ็Ž‹้พ™ๅ…ฌๅ›ญ๏ผ‰ Trexpark้กน็›ฎ็”ฑๆœ‰่ตžๆ•ฐๆฎๆ™บ่ƒฝๅ›ข้˜Ÿๅผ€ๆบ๏ผŒๆ˜ฏๅ›ฝๅ†…้ฆ–ไธชๅŸบไบŽ็”ตๅ•†ๅคงๆ•ฐๆฎ่ฎญ็ปƒ็š„ๅผ€ๆบNLPๅ’Œๅ›พๅƒ้กน็›ฎใ€‚ๆˆ‘ไปฌ้ข„ๆœŸๅฐ†้€ๆญฅๅผ€ๆ”พๅŸบไบŽๅ•†ๅ“ๆ ‡้ข˜๏ผŒ่ฏ„่ฎบ๏ผŒๅฎขๆœๅฏน่ฏ็ญ‰NLP่ฏญ่Š๏ผŒไปฅๅŠๅ•†ๅ“ไธปๅ›พ๏ผŒๅ“็‰Œlogo็ญ‰่ฟ›่กŒ้ข„่ฎญ็ปƒ็š„NLPๅ’Œๅ›พๅƒๆจกๅž‹ใ€‚ ไธบไป€ไนˆๆ˜ฏ้œธ็Ž‹้พ™๏ผŸ ้œธ็Ž‹้พ™ๆ˜ฏๆœ‰่ตž็š„ๅ‰็ฅฅ็‰ฉใ€‚ๅ‘ƒ๏ผŒๅ‡†็กฎ
 

quoll remorse Clojure to morse code conversion Usage Dependencies This can be included in deps.edn with the following entry in the :deps map: com.github.quo
 

dakrone Clojure library interface to OpenNLP - https://opennlp.apache.org/ A library to interface with the OpenNLP (Open Natural Language Processing) library
 

JuliaText WordTokenizers Some basic tokenizers for Natural Language Processing. Installation: As per standard Julia package installation: pkg> add WordTokenizer
 

JuliaText CorpusLoaders A collection of various means for loading various different corpora used in NLP. Installation As per the standard Julia package installa
 

lancopku pkuseg๏ผšไธ€ไธชๅคš้ข†ๅŸŸไธญๆ–‡ๅˆ†่ฏๅทฅๅ…ทๅŒ… (English Version) pkuseg ๆ˜ฏๅŸบไบŽ่ฎบๆ–‡[Luo et. al, 2019]็š„ๅทฅๅ…ทๅŒ…ใ€‚ๅ…ถ็ฎ€ๅ•ๆ˜“็”จ๏ผŒๆ”ฏๆŒ็ป†ๅˆ†้ข†ๅŸŸๅˆ†่ฏ๏ผŒๆœ‰ๆ•ˆๆๅ‡ไบ†ๅˆ†่ฏๅ‡†็กฎๅบฆใ€‚
 

machinalis __ _ _ _ ___ _ __ _ _ / _` | | | |/ _ \ '_ \| | | | | (_| | |_| | __/ |_) | |_| | \__, |\__,_|\___| .__/ \__, | |_| |_| |___/
 

machinalis About Yalign is a tool for extracting parallel sentences from comparable corpora. Statistical Machine Translation relies on parallel corpora (eg.. eur
 

isnowfy SnowNLP: Simplified Chinese Text Processing SnowNLPๆ˜ฏไธ€ไธชpythonๅ†™็š„็ฑปๅบ“๏ผŒๅฏไปฅๆ–นไพฟ็š„ๅค„็†ไธญๆ–‡ๆ–‡ๆœฌๅ†…ๅฎน๏ผŒๆ˜ฏๅ—ๅˆฐไบ†TextBlob็š„ๅฏๅ‘่€Œๅ†™็š„๏ผŒ็”ฑไบŽ็Žฐๅœจๅคง้ƒจๅˆ†็š„่‡ช็„ถ่ฏญ่จ€ๅค„็†ๅบ“ๅŸบๆœฌ้ƒฝๆ˜ฏ้’ˆๅฏน่‹ฑๆ–‡็š„๏ผŒไบŽๆ˜ฏๅ†™ไบ†ไธ€ไธชๆ–นไพฟๅค„็†ไธญๆ–‡็š„็ฑปๅบ“๏ผŒๅนถไธ”ๅ’ŒTextBlob
 

columbia-applied-data-science Rosetta Tools for data science with a focus on text processing. Focuses on "medium data", i.e. data too big to fit into memory but too small to necess
 

proycon This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is regular-expression based, extensible, and advanced tokeniser written in C++
 

proycon Frog for Python This is a Python binding to the Natural Language Processing suite Frog. Frog is intended for Dutch and performs part-of-speech tagging
 

chartbeat-labs textacy: NLP, before and after spaCy textacy is a Python library for performing a variety of natural language processing (NLP) tasks, built on the hig
 

gugarosa NALP: Natural Adversarial Language Processing Welcome to NALP. Have you ever wanted to create natural text from raw sources? If yes, NALP is for you!
 

GrowingGit GitHub English Top Charts ใ€ŒHelp you discover excellent English projects and get rid of disturbing by other spoken language.ใ€ Features โ€ข Definition of