Skip to content

Latest commit

 

History

History
50 lines (36 loc) · 1.5 KB

README.md

File metadata and controls

50 lines (36 loc) · 1.5 KB

goimdb (On-Hold)

IMDB movie database using Golang (only for personal and non-commercial use)

goimdb downloads data from datasets.imdbws.com, currently provided by imdb. These are the subsets of IMDb data that are available for personal and non-commercial use. Each dataset is gzipped, TSV (tab-separated-values) formatted file. The first line of each file contains headers.

goimdb downloads, extracts these file and inserts them into Solr. Solr is capable of conducting full-text search. The latest version of Solr, 7.2.1 is tested with goimdb.

Tech

goimdb uses a number of open source projects to work properly:

Installation

goimdb is go-gettable. The follwing command will download the full project:

$ go get github.com/ashik112/goimb

Solr

In Solr, a core needs be created for goimdb to work.

Create core:

$ solr create -c imdb

Current Status :

<Partially Complete: Download, Decompression, Data insertion in Solr>

<Under-development & Testing: API, User Input>

At this stage of development, goimdb only downloads and extracts the necessary files and certain folders need to be created on the root directory of the project.

    files
    files/archive
    files/decompressed
    files/json

File download, decompression, data insertion are managed by goimdb. Some parts of the project are hard-coded which will be refactored and fixed soon.

License

Apache-2.0