GitHub - kunalPisolkar24/IR_Lab: Collection of practical codes for Savitribai Phule Pune University's Information Retrieval Lab (410247) .

🔍 Information Retrieval Laboratory - SPPU 🔍

Welcome to the repository for the Laboratory Practice IV (410247) course, focusing on Information Retrieval, part of the Fourth Year Computer Engineering curriculum (2019 Course) at Savitribai Phule Pune University. This repository provides practical implementations and resources to help you explore fundamental information retrieval concepts, indexing techniques, evaluation methods, and advanced applications in multimedia and parallel/distributed environments.

🏛️ Course Information:

Feature	Description
University	Savitribai Phule Pune University
Course Name	Laboratory Practice IV (410247)
Credit	01
Practical Sessions	02 Hours/Week
Examination Scheme	Term Work: 50 Marks

🎯 Learning Objectives:

Understand the basic concepts and principles of Information Retrieval (IR).
Study different indexing techniques used in IR systems.
Analyze the performance of information retrieval systems using advanced techniques like classification, clustering, and filtering, especially in multimedia contexts.
Learn about various evaluation methods for IR systems.
Understand the challenges and considerations for scaling basic IR systems into large-scale search services.
Explore parallel information retrieval and web structures.

💡 Course Outcomes:

Upon successful completion of this laboratory course, students will be able to:

CO1: Implement basic information retrieval concepts and build a simple IR system.
CO2: Develop techniques to improve the quality and relevance of retrieved information.
CO3: Apply advanced techniques like classification, clustering, and filtering to analyze multimedia information.
CO4: Evaluate and analyze the performance of information retrieval systems using appropriate metrics.
CO5: Understand the role of information retrieval in various applications and extensions.
CO6: Gain insights into parallel information retrieval techniques and web structures.

📂 Practical Implementations:

Practical No.	Description
1	Text Document Similarity: Write a program to compute the similarity between two text documents using techniques like cosine similarity or Jaccard index.
2	PageRank Algorithm: Implement the PageRank algorithm, a fundamental algorithm used by search engines to rank web pages.
3	Text Pre-processing (Stop Word Removal): Write a program to pre-process a text document by removing stop words (common words like "the," "a," "is," etc.) to improve retrieval efficiency.
4	Character Count (MapReduce): Write a MapReduce program to count the number of occurrences of each alphabetic character in a given dataset. The count should be case-insensitive and ignore non-alphabetic characters.
5	Word Count (MapReduce): Write a MapReduce program to count the number of occurrences of each word in a given dataset. The count should be case-insensitive.

🚀 Getting Started:

Navigate to the relevant practical implementation directory for instructions, code examples, and dataset details (if applicable).

🙌 Contributions:

Contributions, improvements, and feedback are welcome! If you have any enhancements, bug fixes, or additional examples to share, please open a pull request. Refer to the CONTRIBUTING.md file for guidelines.

📄 License:

This repository is distributed under the MIT License. You are free to use, modify, and distribute the code for educational and personal projects.

Let's explore the world of information retrieval and build efficient search systems!

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
five		five
four		four
one		one
three		three
two		two
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔍 Information Retrieval Laboratory - SPPU 🔍

About

Releases

Packages

Languages

License

kunalPisolkar24/IR_Lab

Folders and files

Latest commit

History

Repository files navigation

🔍 Information Retrieval Laboratory - SPPU 🔍

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages