Considering the case empty request list is given to base model #250

sadra-barikbin · 2024-08-02T18:13:32Z

Hi there!

To consider the case in which an empty list is given to the base model. This might be needed at least because LightevalTask.construct_requests might produce a dict with empty entries.

src/lighteval/data.py

tests/test_base_model.py

src/lighteval/data.py

Co-authored-by: Nathan Habib <30601243+NathanHB@users.noreply.github.com>

sadra-barikbin · 2024-08-31T08:21:24Z

@clefourrier @NathanHB
There seems to be something wrong with nanotron 0.4.0 or how we import things from it. Running mere

from nanotron.logging import get_logger

located in hierarchical_logger.py raises import error:

from lighteval.logging.hierarchical_logger import hlog
src/lighteval/logging/hierarchical_logger.py:33: in <module>
    from nanotron.logging import get_logger
../../.local/lib/python3.10/site-packages/nanotron/logging.py:38: in <module>
    from nanotron.config.config import LoggingArgs
../../.local/lib/python3.10/site-packages/nanotron/config/__init__.py:2: in <module>
    from nanotron.config.config import *
../../.local/lib/python3.10/site-packages/nanotron/config/config.py:13: in <module>
    from nanotron.config.lighteval_config import LightEvalConfig
../../.local/lib/python3.10/site-packages/nanotron/config/lighteval_config.py:5: in <module>
    from nanotron.config.parallelism_config import ParallelismArgs
../../.local/lib/python3.10/site-packages/nanotron/config/parallelism_config.py:4: in <module>
    from nanotron.config.utils_config import (
../../.local/lib/python3.10/site-packages/nanotron/config/utils_config.py:8: in <module>
    from nanotron.parallel.pipeline_parallel.engine import (
../../.local/lib/python3.10/site-packages/nanotron/parallel/pipeline_parallel/__init__.py:1: in <module>
    from nanotron.parallel.pipeline_parallel.engine import PipelineEngine
../../.local/lib/python3.10/site-packages/nanotron/parallel/pipeline_parallel/engine.py:8: in <module>
    from nanotron.logging import log_rank
E   ImportError: cannot import name 'log_rank' from partially initialized module 'nanotron.logging' (most likely due to a circular import) (/home/sadrodin/.local/lib/python3.10/site-packages/nanotron/logging.py)

NathanHB · 2024-09-02T11:44:19Z

Hi ! Yes this is an issue on nanotron's side, they have been made aware

sadra-barikbin · 2024-09-03T16:43:03Z

@NathanHB , a test raises permission error when EnvConfig.cache param to AutoConfig.from_pretrained() (during loading BaseModel) has its default value /scratch. Has been choosing this path on purpose?

sadra-barikbin · 2024-10-09T19:27:13Z

@NathanHB, isn't better that EnvConfig's cache default value be huggingface_hub.constants.HF_HUB_CACHE?

…o-base-model' into fix-empty-input-to-base-model

sadra-barikbin added 2 commits August 2, 2024 21:38

Do the changes

d9fdb24

Fix a tiny bug

2c6be17

clefourrier reviewed Aug 8, 2024

View reviewed changes

src/lighteval/data.py Outdated Show resolved Hide resolved

tests/test_base_model.py Outdated Show resolved Hide resolved

sadra-barikbin and others added 3 commits August 13, 2024 00:55

Apply review comment

d9df723

Merge branch 'main' into fix-empty-input-to-base-model

e3c0ab9

Apply review

1d47a92

sadra-barikbin requested a review from clefourrier August 17, 2024 16:40

Merge branch 'main' into fix-empty-input-to-base-model

72010cc

NathanHB reviewed Aug 19, 2024

View reviewed changes

src/lighteval/data.py Outdated Show resolved Hide resolved

NathanHB and others added 3 commits August 19, 2024 13:34

Merge branch 'main' into fix-empty-input-to-base-model

c5719a6

Update src/lighteval/data.py

ab168cc

Co-authored-by: Nathan Habib <30601243+NathanHB@users.noreply.github.com>

Merge branch 'main' into fix-empty-input-to-base-model

b84499e

NathanHB approved these changes Aug 28, 2024

View reviewed changes

Update the test

e0a2431

sadra-barikbin and others added 3 commits September 2, 2024 22:17

Merge branch 'main' into fix-empty-input-to-base-model

934c5e6

Fix formatting

4a22465

Merge branch 'main' into fix-empty-input-to-base-model

50c9b15

sadra-barikbin and others added 4 commits September 4, 2024 14:44

Merge branch 'main' into fix-empty-input-to-base-model

35c2453

Update test

6cbfed9

Merge branch 'main' into fix-empty-input-to-base-model

53aac5d

Merge branch 'main' into fix-empty-input-to-base-model

a01e09e

sadra-barikbin added 2 commits October 9, 2024 22:59

Change env_config cache param in the test

0a9686e

Merge remote-tracking branch 'refs/remotes/upstream/fix-empty-input-t…

e4e6ffa

…o-base-model' into fix-empty-input-to-base-model

sadra-barikbin requested a review from NathanHB October 9, 2024 19:31

Merge branch 'main' into fix-empty-input-to-base-model

e257743

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Considering the case empty request list is given to base model #250

Considering the case empty request list is given to base model #250

sadra-barikbin commented Aug 2, 2024

sadra-barikbin commented Aug 31, 2024 •

edited

Loading

NathanHB commented Sep 2, 2024

sadra-barikbin commented Sep 3, 2024 •

edited

Loading

sadra-barikbin commented Oct 9, 2024

Considering the case empty request list is given to base model #250

Are you sure you want to change the base?

Considering the case empty request list is given to base model #250

Conversation

sadra-barikbin commented Aug 2, 2024

sadra-barikbin commented Aug 31, 2024 • edited Loading

NathanHB commented Sep 2, 2024

sadra-barikbin commented Sep 3, 2024 • edited Loading

sadra-barikbin commented Oct 9, 2024

sadra-barikbin commented Aug 31, 2024 •

edited

Loading

sadra-barikbin commented Sep 3, 2024 •

edited

Loading