Skip to content

Latest commit

 

History

History
119 lines (87 loc) · 3.64 KB

README.md

File metadata and controls

119 lines (87 loc) · 3.64 KB

My Research Toolkit

rtk processes latency log files and generates a gnuplot template file. It can be used to plot (1). latency CDF, (2). IOPS v.s. time, and (3). lat v.s. time graphs for FIO and replayer latency log files

Dependencies

  • Make sure you have gnuplot, gawk and a pdf reader (e.g., evince) installed in your system.

Log file format

  • must be a CSV file
  • 2nd column represents latency or other types of data to process (FIO log style)

File Structures

.
├── README.md
├── all.sh
├── clear.sh
├── dat
├── eps
├── plot
├── raw
│   └── exptest
│       ├── u7_mongo_def-ro-1_lat.log
│       ├── u7_mongo_def-ro-2_lat.log
│       ├── u7_mongo_def-ro-3_lat.log
│       ├── u7_mongo_def-rw-1_lat.log
│       └── u7_mongo_def-rw-2_lat.log
├── script
│   ├── cdf.awk
│   ├── genplot.sh
│   ├── getstat.sh
│   ├── raw2dat.sh
│   ├── sample-cdf-lat.sh
│   └── stat.awk
└── stat
  • functions of major dirs

    • raw/: For holding raw experiment latency log files. For each set of log files you want to put in the same graph, create an experiment folder (e.g. raw/exptest) for it, then put all your log files under expriment folder

    • dat/: For holding percentile data files, generated from raw log files. For above raw/exptest expriment, rtk will create dat/exptest and put all percentile ".dat" files under it, this folder is the data source later used by gnuplot file to do plotting.

    • stat/: For holding basic statistics info about the latency log file, such as avg, median, etc.

    • plot/: For holding gnuplot template files generated for each expriment dataset.

    • eps/: For holding generated eps graphs for each experiment

    • script/: For holding data processing scripts

How To Use RTK

(1). Create raw/$EXPDIR, e.g. take raw/exptest as an example (raw/exptest already created in this repo)

(2). Put all your .log files into raw/exptest, PLEASE make sure your log file format follows the rules stated above. Under raw/exptest, I already put some sample log files there.

(3). Change all.sh to specify (a) the type of graph you want to have and (b) the experiment folder name. Specifically, if you want to plot CDF for raw/exptest experiment folder, make the following changes in all.sh:

  TARGET="exptest"
  TYPE="lat-cdf"

(4). Run ./all.sh.

It will process log files under raw/exptest, generate percentile files under dat/exptest, create statistics file under stat/exptest, generate gnuplot template file as plot/exptest.plot and plot graph as eps/exptest.eps. By default, rtk will use evince to open the newly generated eps graph.

In this repo, (1)(2)(3) is already done for you with an example experiment named "exptest", you can simply do (4) to see results.

(5). Further tune the graph by changing the parameters in template gnuplot file (e.g., plot/exptest.plot), by adjusting X-axis and Y-axis range, line width and colors, etc. After you modify the plot file, replot it by doing:

$ gnuplot plot/exptest.plot

Then, check out the new eps graph in eps/exptest.eps.

Discard an expriment

Basically, if one experiment is no longer needed, you can delete all related files by using clear.sh, e.g. to delete experiment exptest, do

$ ./clear.sh exptest

This will remove {raw,dat,stat}/exptest, plot/exptest.plot and eps/exptest.eps from your rtk folder.