Asura Crow: Search Engine

Asura Crow is a project aimed at replicating the functionality of a basic search engine. It involves crawling, indexing, and searching functionalities implemented using C++, Go, and MongoDB. Various algorithms such as TF-IDF and BM25 are applied to enhance the search results.

Components

Crawler

Description: Responsible for crawling web pages using Go and saving the extracted information into MongoDB for indexing.

Indexer

Description: Implemented in C++, the indexer accesses MongoDB to perform indexing of crawled data and stores it efficiently for quick retrieval during searches.

Searcher

Description: Accesses the MongoDB database and delivers relevant information to the query using various algorithms like TF-IDF and BM25.

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
.github/workflows		.github/workflows
api		api
crawler		crawler
indexer		indexer
search		search
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Asura Crow: Search Engine

Components

Crawler

Indexer

Searcher

About

Releases

Packages

Languages

janhaslik/Asura-Crow

Folders and files

Latest commit

History

Repository files navigation

Asura Crow: Search Engine

Components

Crawler

Indexer

Searcher

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages