Skip to content

Latest commit

 

History

History
17 lines (16 loc) · 1.22 KB

connects.md

File metadata and controls

17 lines (16 loc) · 1.22 KB

Connects docs

  • input-log Read csv files from folder "/inputs/logs/in/". Files with errors will be drop to "/inputs/logs/errors/" folder. Each line will be parsed to avro with InputLog schema (/schema/inputLog.avsc) and create event in input-log-avro topic. Configuration: /connect/config/input-log.json
  • hostname-by-time Read csv files from folder "/inputs/hostnameByTime/in/". Files with errors will be drop to "/inputs/hostnameByTime/errors/" folder. Each line will be parsed to avro with HostnameByTime schema (/schema/HostnameByTime.avsc) and create event in hostname-by-time topic. Configuration: /connect/config/hostnames-by-time.json
  • hostname-agg-by-hour Read csv files from folder "/inputs/hostnameAggByHour/in/". Files with errors will be drop to "/inputs/hostnameAggByHour/errors/" folder. Each line will be parsed to avro with HostnameAggByHour schema (/schema/HostnameAggByHour.avsc) and create event in hostname-aggregations-by-hour topic. Configuration: /connect/config/hostnames-agg-by-hour.json
  • hdfs-sink Read each event from input-log-avro topic and write in hdfs docker in path "/data/input-log/" in parquet format. Configuration: /connect/config/hdfs-link.json