Data augmentation for NLP, accepted at EMNLP 2021 Findings

AEDA: An Easier Data Augmentation Technique for Text Classification

Related Repos

kavgan Phrase-At-Scale Phrase-At-Scale provides a fast and easy way to discover phrases from large text corpora using PySpark. Here's an example of phrases extracted from a review dataset: Features Discover most co

EmilyAlsentzer clinicalBERT Repository for Publicly Available Clinical BERT Embeddings (NAACL Clinical NLP Workshop 2019) Using Clinical BERT UPDATE: You can now use ClinicalBERT directly through the transformers library. Check out

MILVLG OpenVQA OpenVQA is a general platform for visual question ansering (VQA) research, with implementing state-of-the-art approaches (e.g., BUTD, MFH, BAN and MCAN) on different benchmark datasets like VQA-v2, GQA and CLEVR

rusiaaman XLnet-gen Generate language using XLNet. This is not an official implementation. Samples are included at the end of this README as well as in the samples folder. Medium article as a summary of this effort:

huggingface State-of-the-art Natural Language Processing for PyTorch and TensorFlow 2.0 🤗 Transformers (formerly known as pytorch-transformers and pytorch-pretrained-bert) provides state-of-the-art general-pur

uber-archive This is the Plato Dialogue System, a flexible platform for developing conversational AI agents.

deepset-ai (Framework for Adapting Representation Models) What is it? FARM makes Transfer Learning with BERT & Co simple, fast and enterprise-ready. It's built upon transformers and provides additional features to simpl