Skip to content

v0.4.0

Compare
Choose a tag to compare
@NathanHB NathanHB released this 05 Sep 13:28
· 53 commits to main since this release

What's new

Features

New tasks

Fixes

Significant community contributions

The following contributors have made significant changes to the library over the last release:

  • @NathanHB
    • commit (#137)
    • Add llm as judge in metrics (#146)
    • Nathan add logging to metrics (#157)
    • add 'cite as' section in readme (#178)
    • Fix citation section in readme (#180)
    • adding aimo custom eval (#154)
    • fix llm as judge warnings (#173)
    • launch lighteval using lighteval --args (#152)
    • adds llm as judge using transformers (#223)
    • Fix missing json file (#264)
    • change priority when choosing model dtype (#263)
    • fix the location of tasks list in the readme (#267)
    • updates ifeval repo (#268)
    • fix nanotron (#283)
    • add vlmm backend (#274)
    • bump nltk version (#290)
  • @clefourrier
    • Add config files for models (#131)
    • Add fun widgets to the README (#145)
    • Fix nanotron models input size bug (#156)
    • no function we actually use should be named prompt_fn (#168)
    • Add maj@k metric (#158)
    • Homogeneize logging system (#150)
    • Use only dataclasses for task init (#212)
    • Now only uses functions for prompt definition (#213)
    • Data split depending on eval params (#169)
    • should fix most inference endpoints issues of version config (#226)
    • Add metrics as functions (#214)
    • Quantization related issues (#224)
    • Update issue templates (#235)
    • remove latex writer since we don't use it (#231)
    • Removes default bert scorer init (#234)
    • fix (#233)
    • udpated piqa (#222)
    • uses torch compile if provided (#248)
    • Fix inference endpoint config (#244)
    • Expose samples via the CLI (#228)
    • Fixing issues with multichoice_continuations_start_space - was not parsed properly (#232)
    • Programmatic interface + cleaner management of requests (#269)
    • Small file reorg (only renames/moves) (#271)
    • Refactoring the few shot management (#272)
  • @PhilipMay
    • Add Ger-RAG-evaltasks. (#149)
    • Add version config option. (#181)
  • @shaltielshmid
    • Added Namespace parameter for InferenceEndpoints, added option for passing model config directly (#147)
    • Updated tgi_model and added parameters for endpoint_model (#208)
  • @hynky1999
    • make info loggers dataclass, so that their properties have expected lifetime (#280)
    • Remove expensive prediction run during test collection (#279)
    • Probability Metric + New Normalization (#276)
    • Standalone nanotron config (#285)
    • Logging Revamp (#284)