Efficient Image Captioning code in Torch, runs on GPU

NeuralTalk2 Update (September 22, 2016): The Google Brain team has released the image captioning model of Vinyals et al. (2015). The core model is very similar to NeuralTalk2 (a CNN followed by RNN), but the Google release should

Related Repos



twitter torch-twrl: Reinforcement Learning in Torch torch-twrl is an RL framework built in Lua/Torch by Twitter. Installation Install torch git clone https://github.com/torch/distro.git ~/torch --recursive cd ~/torch; ba
 

karpathy NeuralTalk2 Update (September 22, 2016): The Google Brain team has released the image captioning model of Vinyals et al. (2015). The core model is very similar to NeuralTalk2 (a CNN followed by RNN), but the Google release should
 

forence This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
 

Atcold Torch Video Tutorials Light your way in Deep Learning with Torch 🔦 This aims to be a growing collections of introductory video tutorials on the Torch ecosystem. Torch is one of the fastest and most flexible frame
 

zhjohnchan Awesome Image Captioning A curated list of image captioning and related area. :-) Contributing Please feel free to send me pull requests or email ([email protected]) to add links. Markdown format: - [Pap
 

audio-captioning Audio captioning is a novel and exciting research direction, focusing on the automatic generation of textual descriptions (i.e. captions) for general audio. This repository is a list of papers that are focusing on audio captioning.
 

Shreyz-max Video-Captioning Video Captioning is an encoder decoder mode based on sequence to sequence learning. It takes a video as input and generates a caption
 

paraschopra Four-in-one deep network: image search, image captioning, similar words and similar images using a single model
 

bamos setGPU A small Python library that automatically sets CUDA_VISIBLE_DEVICES to the least-loaded GPU on multi-GPU systems. Installation: pip install setGPU Usage: import setGPU before any import that will use a GPU like torch
 

ashnkumar SketchCode Generating HTML Code from a hand-drawn wireframe SketchCode is a deep learning model that takes hand-drawn web mockups and converts them into working HTML code. It uses an image captioning architecture to generat
 

luo3300612 This repository contains the reference code for the paper Duel-Level Collaborative Transformer for Image Captioning.
 

Yijunmaverick CartoonGAN-Test-Pytorch-Torch Pytorch and Torch testing code of CartoonGAN [Chen et al., CVPR18]. With the released pretrained models by the authors, I made these simple scripts for a quick test. Getting started
 

pokerfaceSad GPU Mounter is a kubernetes plugin which enables add or remove GPU resources for running Pods. This Introduction(In Chinese) is recommended to read which can help you understand what and why is GPU Mounter.
 

sgrvinod This is a PyTorch Tutorial to Image Captioning. This is the first in a series of tutorials I'm writing about implementing cool models on your own with the amazing PyTorch library. Basic knowledge of PyTorch, convolutional and recurrent ne
 

LuoweiZhou Vision-Language Pre-training for Image Captioning and Question Answering
 

krasserm Transformer-based image captioning extension of pytorch/fairseq
 

ajamjoom Image Captioning System This repository presents a pyTorch implementation of the Show, Attend, and Tell paper (https://arxiv.org/pdf/1502.03044.pdf) and applies two extentions to it: (1) utalize the GloVe embeddings and (2) integ
 

torch Development Status Torch is not in active developement. The functionality provided by the C backend of Torch, which are the TH, THNN, THC, THCUNN libraries is actively extended and re-written in the ATen C++11 library (source,
 

fonfonx Wasserstein GAN This repository provides a Torch implementation of Wasserstein GAN as described by Arjovsky et. al. in their paper Wasserstein GAN. Prerequisites Torch cutorch, cunn and cudnn to train the netwo
 

nashory gans-collection.torch Torch implementation of various types of GANs (e.g. DCGAN, ALI, Context-encoder, DiscoGAN, CycleGAN, EBGAN). Note that EBGAN and BEGAN implementation is still not stable yet. I am working on this.