Refactor Ersilia test command #1224

kurysauce · 2024-08-08T23:04:35Z

Thank you for taking your time to contribute to Ersilia, just a few checks before we proceed

Have you followed the guidelines in our Contribution Guide
Have you written new tests for your core changes, as applicable?
Have you successfully ran tests with your changes locally?

Description

Refactor the Ersilia test command to run in 100% completion for correctly incorporated models. Currently stops prematurely at the check_example_input test method.

Changes to be made

Ensure correct return values and flags are appropriately passed to methods in Model Tester class. Debug other methods as test command runs closer to completion.

EDIT: Completed

Status

Added correct returns and flags to all methods
Fixed path parsing in run_bash method
Corrected original model size calculation implementation by implementing get_directories_sizes() method and associated helper functions.
Updated check_consistent_output method by calculating Spearman's Correlation & Mean Relative Absolute Value thresholds, replacing 5% difference threshold due to false positive failed test cases
Refactored read_csv method to handle differences in column parsing with Ersilia output and Bash output
Specified output file generation + time stamp at completion of a successful test run on model

To do

Must test updated command on multiple models to ensure robustness. Ensure diversified outputs are tested. Test command may reveal hidden flaws from previous model incorporations (non-existent run.sh files, inconsistent outputs comparisons, execution errors in run.sh files that reveal problems in model's main.py script, etc...), open issues for specific models as needed. Seeking help from community members to test models!

Models must be fetched using --from_github or --from_s3 flag, or else test script will fail.

Is this pull request related to any open issue? If yes, replace issueID below with the issue ID

Related to #1203 , #1222, #1221

* WIP Add session management utils * Create a session at ersilia cli initialization * Remove unused code * start an ersilia session in a dedicated sessions dir that is mapped to the parent process' id which ran the given ersilia command, this will generally be a shell process inside a terminal, but it can also be a process from a bash script * declare session specific defaults * Run all ersilia commands within a single process during standard example run, otherwise ersilia run command does not find a served model bec of running in a different process and therefore in a different session * Move the currently served model's pid to its dedicated session directory * WIP Logging * Redirect tmp logs to model's session logs * catch permission error * Redirect tmp logs to model's session logs * don't use ersilia exception management because we don't exactly want the ersilia process to exit

.devcontainer/devcontainer.json

ersilia/io/input.py

ersilia/publish/test.py

kurysauce · 2024-08-12T21:54:00Z

Updated PR @miquelduranfrigola

DhanshreeA · 2024-08-14T06:49:16Z

Hi @kurysauce - I've gone through your refactor of the test command and I think this is very much headed in the right direction. I have a small concern - I am unsure if we are accounting for output types within the check for consistent outputs - the code seems to be written only for dealing with numeric output types, as I don't see any check on the output type anywhere. Is that indeed the case?

DhanshreeA · 2024-08-14T15:34:22Z

@kurysauce I've another request for you for this PR. The previous contributors left a lot of print statements within the code, which is generally not a good practice. Would it be possible for you to replace them with debug and info logs? :)

kurysauce · 2024-08-14T16:05:01Z

@kurysauce I've another request for you for this PR. The previous contributors left a lot of print statements within the code, which is generally not a good practice. Would it be possible for you to replace them with debug and info logs? :)

@DhanshreeA here is the link to the completed Google Doc. Additionally, I replaced the print statements with debug and logging. I don't think I see anything else on my end to update on the test script for now.

kurysauce · 2024-08-14T16:08:55Z

Hi @kurysauce - I've gone through your refactor of the test command and I think this is very much headed in the right direction. I have a small concern - I am unsure if we are accounting for output types within the check for consistent outputs - the code seems to be written only for dealing with numeric output types, as I don't see any check on the output type anywhere. Is that indeed the case?

@DhanshreeA I believe the test commands tests string as well in these spots: 1 and 2

DhanshreeA · 2024-08-15T07:07:51Z

@kurysauce I've another request for you for this PR. The previous contributors left a lot of print statements within the code, which is generally not a good practice. Would it be possible for you to replace them with debug and info logs? :)

Sounds good @DhanshreeA ! Just to clarify, what would be appropriate to print vs log? (outputs from failed runs, output from successful runs, indication of new test check, etc...)?

Generally speaking, there should be no print statements and everything that needs to be logged onto the terminal should essentially be done through a logger and not through print. :)

miquelduranfrigola · 2024-08-19T12:23:09Z

Yes indeed, generally speaking we want to log information instead of printing. Note that logging is activated only with the -v flag, though. So, to show on screen print-like messages, we can use the echo command from the click library as done by the ersilia CLI.

In any case, there are plenty of examples within the Ersilia codebase as to how we should be logging messages. Usually, they are accessible via the self.log.debug/info/warning methods if ErsiliaBase is used as a parent class.

I hope this helps!

DhanshreeA and others added 30 commits July 26, 2024 17:30

copy relevant files

d3430f4

read from 3rd column, logging statements bash output file

ab2ac7e

run_bash path edit, conda.sh current failed fix

a9c3ded

add new size methods, clean directory size method

116f91c

add additional logging of file types and count

a6b64e8

delete dead function

279c188

fix run path

f00bd7b

uncomment testing

c06cff5

fix bash output file opening (arg1)

258d0c4

change var names for clarity and add debug prints

2ef94a5

addtional logging statements + updated read_csv method

97281d6

fixed updated_read_csv

72a23b1

Delete comments and dead code

73e1b80

clean up read_csv

5cb00eb

Merge branch 'master' of https://github.com/kurysauce/ersilia_test

ed982ad

fix comparison method

399c638

add prints

2873382

model size calculation update

e243cb8

delete comments, update state variables

9c33c19

add time stamp

712b7ec

add debugging print for thresholds

295792b

update threshold debugging statements

20134a4

implement MRAE + Spearman's

26c91cd

bash output file path log

74a1058

fix divide by 0 mrae calculation

3af3a63

Update threshold

1654d43

bash script logging statements

2a2d462

Merge branch 'master' of https://github.com/kurysauce/ersilia_test

f25b29d

bold message debug

2fd1bd8

kurysauce and others added 10 commits August 8, 2024 22:43

generalize read_csv for different column numbers

2b57c0e

uncomment

98a30cc

specify output

df76dae

include scipy dependecy in pyproject.toml

770d351

delete dead function + comment

a453f8b

Merge branch 'master' into master

cef9a22

delete unused var

6024c8b

Merge branch 'master' of https://github.com/kurysauce/ersilia_test

19822d0

fairyfloss bckgrnd

20d903f

clarity code mrae calculation

758e249

DhanshreeA requested changes Aug 12, 2024

View reviewed changes

kurysauce added 2 commits August 12, 2024 20:53

adjust control flow example method

5290025

update metric calculations, delete comments

458dbec

kurysauce requested a review from DhanshreeA August 12, 2024 21:53

Update comment wording

1beb755

kurysauce added 3 commits August 14, 2024 21:48

remove prints and add log statements

5e284a4

Merge branch 'master' of https://github.com/kurysauce/ersilia_test

7ee783f

end test command statement

0e91dc5

fix log format + delete echo and prints

1253853

Merge branch 'master' into master

6ed028a

DhanshreeA merged commit 2a0ba90 into ersilia-os:master Aug 19, 2024
16 checks passed

kurysauce mentioned this pull request Aug 19, 2024

🐛 Bug: Ersilia Test Command: False Positive Test Failure #1203

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor Ersilia test command #1224

Refactor Ersilia test command #1224

kurysauce commented Aug 8, 2024 •

edited

Loading

kurysauce commented Aug 12, 2024

DhanshreeA commented Aug 14, 2024

DhanshreeA commented Aug 14, 2024

kurysauce commented Aug 14, 2024 •

edited

Loading

kurysauce commented Aug 14, 2024 •

edited

Loading

DhanshreeA commented Aug 15, 2024

miquelduranfrigola commented Aug 19, 2024

Refactor Ersilia test command #1224

Refactor Ersilia test command #1224

Conversation

kurysauce commented Aug 8, 2024 • edited Loading

kurysauce commented Aug 12, 2024

DhanshreeA commented Aug 14, 2024

DhanshreeA commented Aug 14, 2024

kurysauce commented Aug 14, 2024 • edited Loading

kurysauce commented Aug 14, 2024 • edited Loading

DhanshreeA commented Aug 15, 2024

miquelduranfrigola commented Aug 19, 2024

kurysauce commented Aug 8, 2024 •

edited

Loading

kurysauce commented Aug 14, 2024 •

edited

Loading

kurysauce commented Aug 14, 2024 •

edited

Loading