Skip to content

GrowthEngineAI/newspaper4k

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

newspaper4k

Modified version of newspaper News Extraction with Async Support, focused on performance.


Quick Start

pip install git+https://github.com/GrowthEngineAI/newspaper4k
import anyio
from newz import AsyncArticle, async_build

async def test_article():
    url = 'https://github.blog/2022-06-06-introducing-github-skills/'
    article = AsyncArticle(url)

    await article.async_build()
    print('ARTICLE SUMMARY\n', article.summary)

    print('ARTICLE TEXT\n', article.text)
    
async def test_build():
    url = 'https://www.cnn.com'
    cnn_paper = await async_build(url)

    for article in cnn_paper.articles:
        print(article.url)

    cnn_article = cnn_paper.articles[0]
    await cnn_article.async_build()
    print(cnn_article.text)
    

async def run_test():
    await test_article()
    await test_build()
    
if __name__ == '__main__':
    anyio.run(run_test)

About

News Extraction with Async Support

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages