A high-level distributed crawling framework.

Cola: high-level distributed crawling framework Overview Cola is a high-level distributed crawling framework, used to crawl pages and extract structured data from websites. It provides simple and fast yet flexible

Related Repos



lobstrio shadow-useragent shadow-useragent gives you access to the most commonly used UserAgents on the Internet, safe from outdated data. Behold, the power of UserAgent: >>> import shadow_useragent >>> ua = shadow_u
 

maxhumber About gazpacho is a web scraping library. It replaces requests and BeautifulSoup for most projects. gazpacho is small, simple, fast, and consistent. You should use it! Usage gazpacho
 

shaohua0116 Crawl and Visualize ICLR 2020 OpenReview Data Descriptions This Jupyter Notebook contains the data crawled from ICLR 2020 OpenReview webpages and their visualizations. The list of submissions (sorted by the average
 

0x0ptim0us Twitter High level scraper for humans.
 

mikf Command-line program to download image-galleries and -collections from several image hosting sites
 

Jodagito YouTube Downloader V1.1 Requirements python>=3.4 pytube3==9.5.13 Installation Open a terminal and execute git clone https://github.com/Jodagito/YoutubeDownloader on the destination folder. Once git fin
 

luisvonmuller Spotify Playlist Downloader Downloads songs as listed on a spotify playlist from youtube. Uses scrapy, selenium (need chrome webdriver) and youtube_dl. It also uses a library called ffmpeg to convert from webm (without video) to
 

dvingerh Python script to download videos from a TikTok profile without any watermarks.