Speech and Music Detection
Python framework for Speech and Music Detection using Keras.
This repository contains the experiments presented in the paper "Temporal Convolutional Networks for Speech and Music Detection in
An Tensorflow implementation of PersonLab for Multi-Person Pose Estimation and Instance Segmentation. Identify every person instance, localize its facial and body keypoints, and estimate its instance segmentat
+ March 27: Released v1.1 with new and improved
+ functionality for image retrieval, object detection,
+ keypoint detection and action recognition.
+ For additional details, please refer to our releases page.
ceevee (read like CV, i.e. computer vision) is a Python library for various computer vision problems with a focus on easy usage.
ceevee aims to be a bridge between deep learning practitioners training accurate
HD CelebA Cropper
CelebA dataset provides an aligned set img_align_celeba.zip. However, the size of each aligned image is 218x178, so the faces cropped from such images would be even smaller!
Here we provide a code to
This is a desktop demo for the following paper. If you find the code useful, please cite the paper.
MoSculp: Interactive Visualization of Shape and Time Xiuming Zhang, Tali D
Cytokit is a collection of tools for quantifying and analyzing properties of individual cells in large fluorescent microscopy datasets with a focus on those generated from multiplexed staining protocols. This
PyColorPalette is a Python 3 tool capable of pulling a list of the top colors, or the color at a specific index, from a given image through the process of K-means clustering. Images can be provided either
Video Frame Synthesis using Deep Voxel Flow
We address the problem of synthesizing new video frames in an existing video, either in-between existing frames (interpolation), or subsequent to them (extrapolation). Our met
Who The Hill
What is Who The Hill?
Shazam, but for House members faces.
Who The Hill is an MMS-based facial recognition service for members of Congress. Reporters covering Congress can text pictures of membe
Sharingan is a tool built on Python 3.6 using OpenCV 3.2 to extract news content as text from newspaper’s photo and perform news context extraction.
For more details and explanation, please refer the blog
PyOCR is an optical character recognition (OCR) tool wrapper for python. That is, it helps using OCR tools from a Python program.
It has been tested only on GNU/Linux systems. It should also work on similar syste
A simple, Pillow-friendly, wrapper around the tesseract-ocr API for Optical Character Recognition (OCR).
tesserocr integrates directly with Tesseract's C++ API using Cython which allows for a simple Pyt