An implementation of model parallel [GPT2]& [GPT3]-like models

An implementation of model parallel [GPT2]& [GPT3]-like models, with the ability to scale up to full GPT3 sizes (and possibly more!), using the [mesh-tensorflow]( library.

Related Repos

dlatk Differential Language Analysis ToolKit DLATK is an end to end human text analysis package, specifically suited for social media and social scientific applications. It is written in Python 3 and developed by the World Well-Being P

gutfeeling Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.

ParhamP Makes famous people speak whatever you wish by linking their words

anantzoid Language Modeling with Gated Convolutional Networks This is a Tensorflow implementation of Facebook AI Research Lab's paper: Language Modeling with Gated Convolutional Networks. This paper applies a convolutional approach to lang

csurfer rake-nltk RAKE short for Rapid Automatic Keyword Extraction algorithm, is a domain independent keyword extraction algorithm which tries to determine key phrases in a body of text by analyzing the frequency of word appearan

oxford-cs-deepnlp-2017 Practical 1: word2vec [Brendan Shillingford, Yannis Assael, Chris Dyer] For this practical, you'll be provided with a partially-complete IPython notebook, an interactive web-based Python computing environment that allows us to m

uclatommy Introduction Tweetfeels relies on VADER sentiment analysis to provide sentiment scores to user-defined topics. It does this by utilizing Twitter's streaming API to listen to real-time tweets around a particular t