Extractum is a PHP library that extracts information from web pages.
-
Updated
Jul 1, 2024 - PHP
Extractum is a PHP library that extracts information from web pages.
A list of familiar American-English words.
A list of simple American-English words (revised Spache).
Go package that cleans a HTML page for better readability.
Python & command-line tool to gather text on the Web: Crawling & scraping, content extraction, metadata. TXT, Markdown, CSV & XML output.
An HTTP proxy that parses only text, links and pictures from pages reducing internet bandwidth usage, removing ads and heavy scripts
A very simple python script to strip clutters from montreal gazette readability web page for peoble with an handicap situation.
For Notion,OneNote,Bear,Yuque,Joplin。Clip anything to anywhere
A code-golfing language experience that has aspects of traditional programming languages - terse, elegant, readable.
To extract main article from given URL with Node.js
Chrome Extension to Summarize or Chat with Web Pages/Local Documents Using locally running LLMs. Keep all of your data and conversations private. 🔐
A modern reader mode and article library for your browser.
A Python library for calculating a large variety of metrics from text
ESLint plugin for John Resig-style micro template, Lodash's template, Underscore's template and EJS.
Offical OpenDyslexic browser extension
RateMyPDF is a website that helps paper form authors (particularly for court forms) improve the usability of their forms for self-represented litigants. It uses the FormFyxer library to deliver its insights.
A Telegram bot that makes webpages "readable" (Instant View & Inline Mode)
A PHP-based reading list for web articles. Maintains a searchable archive and computes lots of stats.
Add a description, image, and links to the readability topic page so that developers can more easily learn about it.
To associate your repository with the readability topic, visit your repo's landing page and select "manage topics."