TMDA (Transportation Mode Detector and Analyzer)

Real-time transportation mode detector and analyzer trained on 5 classes (bus, car, train, still, walking) to recognize human activity, based solely on two sensors coming from the user's smartphone: accelerometer and gyroscope.

Prediction accuracy using this detector is about 72%, meaning that 7.2 out of 10 times it will detect the correct transportation mode.

🚀 Quickstart

$ git clone https://github.com/Alessandrus00/TMDA.git
$ cd TMDA
$ wget -P ./spark/setup https://dlcdn.apache.org/spark/spark-3.2.1/spark-3.2.1-bin-hadoop3.2.tgz
$ docker-compose up -d

📊 Workflow

⚙️ How does it work?

A container named spark-training trains a Gradient-Boosted Tree classification model, starting from data inside the path /dataset/training/sensors_training_clean.csv, which is a smaller version of /dataset/training/dataset_5secondWindow.csv, that can be downloaded here. This model is then saved to spark/model.
At the same time, a container named zip-extractor waits for zip files containing sensors data, generated by the mobile application phyphox, to be added to /dataset/zips. Next it extracts and appends raw data to dataset/sensors.csv (if this dataset doesn't exist, it will be created).
An event streaming cluster is build up from broker (Kafka broker) and zookeeper containers, with an additional one, init-kafka, that creates two Kafka topics: sensors-raw (for raw data) and sensors (for clean data).
When sensors.csv contains new entries, a container named logstash, running the data ingestor Logstash, reads those entries and writes them to sensors-raw topic.
Once data are stored inside sensors-raw topic, a container running Spark, named spark-cleaning and subscribed to this topic, is responsible for reading, cleaning and writing the results to sensors topic. Cleaning mainly consists in:
- grouping data by user ID and session window (so that each user has a window started and ended by events themselves)
- aggregating sensors data, in order to extract the features mean, min, max and stddev (standard deviation) for each sensor (excluding the ones that produce geo data).
Another container running Spark and called spark-streaming reads from sensors topic and predicts the transportation mode using the previously created model. Resulting dataframe is then sent to the data indexer Elasticsearch.
Elasticsearch is run into a dedicated container called elasticsearch that provides a fast way to search and query data.
Finally, a container named kibana, running the data visualization tool Kibana, provides an elegant dashboard, where data coming from Elasticsearch are represented through awsome charts. Remember to import kibana/saved objects/kibana_objects.ndjson to Kibana using the import button in the section Stack Management > Saved Objects.

📈 Dashboard

👁 Peeking

Container	URL	Description
broker	http://localhost:8080	Kafka UI
elasticsearch	http://localhost:9200/	ElasticSearch
elasticsearch	http://localhost:9200/sensors/_search	ElasticSearch index content
kibana	http://localhost:5601	Kibana

Inspired by

http://cs.unibo.it/projects/us-tm2017/index.html

🤵 Author

Alessandro Resta

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
dataset		dataset
docs		docs
kibana/saved objects		kibana/saved objects
logstash/pipeline		logstash/pipeline
spark		spark
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TMDA (Transportation Mode Detector and Analyzer)

🚀 Quickstart

📊 Workflow

⚙️ How does it work?

📈 Dashboard

👁 Peeking

Inspired by

🤵 Author

About

Releases

Packages

Languages

License

Alessandrus00/TMDA

Folders and files

Latest commit

History

Repository files navigation

TMDA (Transportation Mode Detector and Analyzer)

🚀 Quickstart

📊 Workflow

⚙️ How does it work?

📈 Dashboard

👁 Peeking

Inspired by

🤵 Author

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages