A very simple news crawler with a funny name
-
Updated
Dec 23, 2024 - Python
A very simple news crawler with a funny name
This demo repository illustrates how to use Python to scrape news articles from Google based on a given keyword. The scraped articles are then processed by Azure OpenAI Service (AOAI)'s GPT-3 model, which generates concise summaries of the main points. The summaries are then formatted and sent via email using MailJet API.
Online webapp that scrapes news from different new portals of Nepal and worldwide. Hosted at heroku.
GNewsScraper is a TypeScript package that scrapes article data from Google News based on a keyword or phrase. It returns the results as an array of JSON objects, making it convenient to access and use the scraped information
This is a demo repo to demonstrate how to use Python Requests and Beautiful Soup 4 libraries to search and scrape result data. And save scraping results into Pandas dataframe.
implement what this video says https://www.youtube.com/watch?v=BVizDqOfins
Code to scrape news articles from the website of the newspaper Times of India
News scrapping using Python concurrent
News Scraping Using Python - Tërheq të dhëna nga webfaqe të ndryshme
This webapp scrapes the news of top media houses in Kenya. Does sentiment analysis on the news to determine if the news was positive ,negative or neutral and then plots graphs using the data.
Web scrapping news using BeautifulSoup
This project implements a Document Retrieval System that integrates GPT-3.5-turbo for query expansion and answer generation. It fetches and ranks documents based on user queries, leveraging MongoDB for document storage, Redis for caching, and web scraping to keep documents updated. The system is designed to provide fast and accurate search results
Arachnio client library for Java 11+
Arachnio client library for Python 3.10+
Add a description, image, and links to the news-scraping topic page so that developers can more easily learn about it.
To associate your repository with the news-scraping topic, visit your repo's landing page and select "manage topics."