Skip to content

Latest commit

 

History

History
127 lines (94 loc) · 4.81 KB

ReadMe.md

File metadata and controls

127 lines (94 loc) · 4.81 KB

Project Title

Multi-Dimensional Search

Project Description

Consider the web site of a seller like Amazon.
They carry tens of thousands of products, and each product has many attributes (Name, Size, Description, Keywords, Manufacturer, Price, etc.).
The search engine allows users to specify attributes of products that they are seeking, and shows products that have most of those attributes. To make search efficient, the data is organized using appropriate data structures, such as balanced trees. But, if products are organized by Name, how can search by price implemented efficiently? The solution, called indexing in databases, is to create a new set of references to the objects for each search field, and organize them to implement search operations on that field efficiently. As the objects change, these access structures have to be kept consistent.

In this project, each object has 3 attributes: id (long int), description (one or more long ints), and price (dollars and cents). The following operations are supported:

a. Insert(id,price,list): insert a new item whose description is given in the list. Returns 1 if the item is new, and 0 otherwise.

b. Find(id): return price of item with given id (or 0, if not found).

c. Delete(id): delete item from storage. Returns the sum of the long ints that are in the description of the item deleted(or 0, if such an id did not exist).

d. FindMinPrice(n): given a long int, find items whose description contains that number (exact match with one of the long ints in the item's description), and returns lowest price of those items. Returns 0 if there is no such item.

e. FindMaxPrice(n): given a long int, find items whose description contains that number, and returns highest price of those items. Returns 0 if there is no such item.

f. FindPriceRange(n,low,high): given a long int n, find the number of items whose description contains n, and in addition, their prices fall within the given range, [low, high].

g. PriceHike(l,h,r): increase the price of every product, whose id is in the range [l,h] by r%. Returns the sum of the net increases of the prices.

h. RemoveNames(id, list): Remove elements of list from the description of id. It is possible that some of the items in the list are not in the id's description. Returns the sum of the numbers that are actually deleted from the description of id. Returns 0 if there is no such id.

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.

Prerequisites

What things you need to install the software and how to install them

1. Download and Install JDK software
2. Set Environment Variable

How to execute code

NOTE: while executing from command prompt, the pwd should be the directory containing the directory ypp170130

Compile the MDS.java by executing the following command

javac ypp170130/MDS.java

Compile and run the driver

javac ypp170130/LP3Driver.java
java -Xss512m -Xms2g ypp170130/LP3Driver PATH_INPUT_FILE

Example:

java -Xss512m -Xms2g ypp170130/LP3Driver ypp170130/input/401.txt

NOTE: Input files are in ypp170130/input/ folder

Input Specification

Initially, the store is empty, and there are no items. The input contains a sequence of lines (use test sets with millions of lines). Lines starting with "#" are comments. Other lines have one operation per line: name of the operation, followed by parameters needed for that operation (separated by spaces). Lines with Insert operation will have a "0" at the end, that is not part of the name. The output is a single number, which is the sum of the following values obtained by the algorithm as it processes the input.

Sample Input:

Insert 22 19.97 475 1238 9742 0
# New item with id=22, price="$19.97", name="475 1238 9742"
# Return: 1
#
Insert 12 96.92 44 109 0
# Second item with id=12, price="96.92", name="44 109"
# Return: 1
#
Insert 37 47.44 109 475 694 88 0
# Another item with id=37, price="47.44", name="109 475 694 88"
# Return: 1
#
PriceHike 10 22 10
# 10% price increase for id=12 and id=22
# New price of 12: 106.61, Old price = 96.92.  Net increase = 9.69
# New price of 22: 21.96.  Old price = 19.97.  Net increase = 1.99
# Return: 11.68  (sum of 9.69 and 1.99).  Added to total: 11
#
FindMaxPrice 475        
# Return: 47.44 (id of items considered: 22, 37).  Added to total: 47
#
Delete 37
# Return: 1366 (=109+475+694+88)
#
FindMaxPrice 475        
# Return: 21.96 (id of items considered: 22).  Added to total: 21
#


Output:
1448

Authors