Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.

doc2text doc2text extracts higher quality text by fixing common scan errors Developing text corpora can be a massive pain in the butt. Much of the text data we are interested in as scientists are

Related Repos

zfergus TopOpt — A Python Library for Topology Optimization

jasmcaus Caer is a lightweight Computer Vision library for high-performance AI research. It simplifies your approach towards Computer Vision by abstracting away unnecessary boilerplate code enabling maximum flexibility. By offering powerful image and video processing algorithms, Caer provides both casual and advanced users with an elegant interface for Machine vision operations.

LeandroBarone Python package that converts images into ASCII art for terminals and HTML.

getsolus Budgie Desktop View is the official Budgie desktop icons application / implementation, developed by Solus.

beurtschipper Depix is a tool for recovering passwords from pixelized screenshots.

phurwicz Hover is a machine teaching library that enables intuitive and effecient supervision. In other words, it provides a map where you hover over and label your data... differently.

AhmetFurkanDEMIR Image encryption and embedding encrypted text in the image.

flomlo A html canvas based screencasting server with occasional ground-truth updates via screenshots and very fast input drawing