saving measurements during compile and run time #108

quic-morteza · 2024-09-13T22:47:47Z

For the user to keep track of their measurements, code is added to log the necessary arguments and the measurements into a csv file. E.g. running the following command two times:

python -m QEfficient.cloud.infer --model_name gpt2 --batch_size 4 --prompt_len 64 --ctx_len 1024 --generation_len 512 --mxfp6 --num_cores 16 --device_group [0] --prompt "My name is|My name is|My name is|My name is" --benchmark

generates gpt2_benchmarking.csv

ochougul · 2024-09-16T10:07:04Z

@quic-mamta Please review.

QEfficient/cloud/infer.py

ochougul · 2024-09-16T13:43:22Z

QEfficient/cloud/infer.py

+
+        compile_time = "NA"
+


can you remove this, it's not required. as when benchmark=True, this variable will always exist

This assignment is needed for if the model was pre-compiled, otherwise compile_time will be unassigned. I changed the assignment as compile_time = "pre-compiled" for this case.
Please see https://github.com/quic-morteza/efficient-transformers/blob/8f6abc5a57bd369c4deb1cea8d18725ddc57e2a2/QEfficient/cloud/infer.py#L92

QEfficient/cloud/infer.py

ochougul · 2024-09-16T13:48:36Z

QEfficient/utils/logging_utils.py

+    with open(file, "a", newline="") as csvfile:
+        csvwriter = csv.writer(csvfile)
+        csvwriter.writerow(list(fields.values()))


This will keep on writing to the same CSV, as long as nobody deletes it.
Should we always create new CSV with timestamp and let user decide how to combine those CSVs.

If single CSV keeps updating, it might have too much data and it might be hard to distinguish which data is relevant.

I believe it is needed to accumulate the measurements for the same model in the same CSV file via various runs. Later it is easy for the user to sort through this file and compare against the previous measurements.

QEfficient/cloud/infer.py

quic-morteza · 2024-10-21T21:44:37Z

Thanks for the reviews. Here is how the gpt2_benchmarking.csv look like after running the following command two times:

python -m QEfficient.cloud.infer --model_name gpt2 --batch_size 4 --prompt_len 64 --ctx_len 1024 --mxfp6 --num_cores 16 --prompt "My name is|My name is|My name is|My name is" --benchmark

quic-rishinr · 2024-10-22T06:05:32Z

QEfficient/cloud/infer.py

@@ -35,6 +42,7 @@ def main(
    local_model_dir: Optional[str] = None,
    cache_dir: Optional[str] = None,
    hf_token: Optional[str] = None,
+    benchmark: bool = False,


Can you update the doc string with this flag usage?

Couldn't find. Can you point me to the right doc string path?

you can add it on line number 71

QEfficient/utils/_utils.py

quic-rishinr · 2024-10-22T07:32:51Z

@quic-morteza please resolve the formatting issue and DCO error.

Signed-off-by: quic-morteza <quic_morteza@quicinc.com>

quic-mamta · 2024-10-28T18:39:05Z

QEfficient/utils/_utils.py

+):
+    input_len = max([len(x) for x in tokenizer(prompt, return_tensors="np").input_ids])
+
+    fields = {


Two more fields mos and aic_enable_depth_first can be added here.

quic-mamta · 2024-10-28T18:41:46Z

QEfficient/cloud/infer.py

@@ -106,10 +120,13 @@ def main(
            full_batch_size=full_batch_size,
        )

+        compile_time = (time.perf_counter() - compile_start_time) // 1


This can also be moved under the if condition of benchmark flag, also why keeping it integer?

quic-rishinr · 2024-11-12T11:27:43Z

@quic-morteza Can you address the review comments?

quic-morteza requested review from quic-mamta, anujgupt-github, vbaddi and ochougul as code owners September 13, 2024 22:47

quic-morteza mentioned this pull request Sep 13, 2024

Update infer.py for logging measurements into a csv file #104

Closed

ochougul reviewed Sep 16, 2024

View reviewed changes

QEfficient/cloud/infer.py Outdated Show resolved Hide resolved

ochougul reviewed Sep 16, 2024

View reviewed changes

QEfficient/cloud/infer.py Outdated Show resolved Hide resolved

ochougul reviewed Sep 16, 2024

View reviewed changes

quic-mamta reviewed Sep 16, 2024

View reviewed changes

QEfficient/cloud/infer.py Outdated Show resolved Hide resolved

quic-morteza requested a review from quic-rishinr as a code owner October 21, 2024 19:18

quic-rishinr reviewed Oct 22, 2024

View reviewed changes

quic-rishinr requested a review from quic-mamta October 22, 2024 07:33

quic-morteza force-pushed the mini_bench branch from fd8427f to 961ee25 Compare October 22, 2024 19:31

quic-morteza and others added 3 commits October 22, 2024 13:06

saving measurements during compile and run time

bc7640d

Signed-off-by: quic-morteza <quic_morteza@quicinc.com>

PR reviews addressed

1d0c07f

Signed-off-by: quic-morteza <quic_morteza@quicinc.com>

reviews_2 addressed

26570ce

Signed-off-by: quic-morteza <quic_morteza@quicinc.com>

quic-morteza force-pushed the mini_bench branch from 961ee25 to 26570ce Compare October 22, 2024 20:07

quic-mamta reviewed Oct 28, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

saving measurements during compile and run time #108

saving measurements during compile and run time #108

quic-morteza commented Sep 13, 2024

ochougul commented Sep 16, 2024

ochougul Sep 16, 2024

quic-morteza Oct 21, 2024 •

edited

Loading

ochougul Sep 16, 2024

quic-morteza Oct 21, 2024

quic-morteza commented Oct 21, 2024

quic-rishinr Oct 22, 2024

quic-morteza Oct 22, 2024

quic-rishinr Oct 28, 2024

quic-rishinr commented Oct 22, 2024

quic-mamta Oct 28, 2024

quic-mamta Oct 28, 2024

quic-rishinr commented Nov 12, 2024

saving measurements during compile and run time #108

Are you sure you want to change the base?

saving measurements during compile and run time #108

Conversation

quic-morteza commented Sep 13, 2024

ochougul commented Sep 16, 2024

Choose a reason for hiding this comment

quic-morteza Oct 21, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

quic-morteza commented Oct 21, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

quic-rishinr commented Oct 22, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

quic-rishinr commented Nov 12, 2024

quic-morteza Oct 21, 2024 •

edited

Loading