FCDiffScorer

Proposed change

This analysis aims to correct the difficulty rating given for Free Contest problems on VNOJ.

The first iteration of this analysis proposes a change to the difficulty scoring of 332 problems that have a text-based ranking board on the Free Contest database before 2022.

Compared to the existing VNOJ scoring, the average change is -0.08 with a std deviation of 0.33.

Relative difficulty curve

Assuming each problem has a difficulty rating of R in [0.01, 1.00]. If no one solves a problem, R defaults to 1.00, while if everyone solves a problem, R defaults to 0.01. This later will be scaled by the difficulty multiplier for each type of contest:

For Beginner Free Contest: multiplier = 0.75
For IOI and TST practice contests: multiplier = 2.00
For other contests: multiplier = 1.50

Let P be the in-contest AC rate, calculated by dividing the in-contest AC count (ac) by the number of true participants (tp).
Inspired by the modified Elo Rating system used on Codeforces for problemset difficulty, for every 0.50 points increase in the problem difficulty (scale from 0.01 to 1.00), P is divided by 10. Let f(P) be this initial function:

f(x) = (-\log_{10}(x*0.99+0.01)/2*0.99+0.01)

This function works well for easy problems, but for more difficult the reward is too low. A small buff is needed for harder problems with AC rate < 0.5

g(x) = f(x)+(1-f(x))*(\max(0,0.5-x)^{2})

I intend to reward a >0.9 difficulty for problems with a lower than 1% AC rate. Another buff is needed for such extremely difficult problems.

h(x) = g(x)+(1-g(x))*((\max(0,0.125-x)*6)^{3})

Desmos demostration: https://www.desmos.com/calculator/ahstqyorda

Calculation of True Participant count (`tp`)

One approach is to take the count of non-zero scoring participants as the True Participant count. However, there exist old contests (such as Free Contest 40), where it was found that 70% of participants got zero scores. One task in this contest was found to be solved by all non-zero-scoring participants. Reading that problem statement, it definitely was not an easy problem considering the contest only had 4 problems. By the old approach, this task would have a 100% AC rate and having difficulty of 0.01. Many other contests face the same problem, where all tasks are too hard that more than half of registrants get 0 score.

One analysis has found that the zero-scoring ratio per contest is avg .43, std deviation of .15. Therefore, the bottom 15% of the ranking board shall be removed while keeping other scores as part of actual participants. I propose the new calculation of tp as .85 * (all participants count), regardless of zero contest scores.

About all the files

For VNOJ admin, the final suggested change is given in analysis_final.csv. The problem codename is in the Task column, its new rating is in the JRatingM column. Please run another check on whether the Task codename exists on VNOJ, as this table is calculated based on the Free Contest database.

To read the detailed final work, consult the analysis_final.xlsx file. Other results are located in the analysis subdirectory.

Running this notebook

To run this Jupyter notebook (and come up with the final result yourself):

Install all requirements as in requirements.txt.
Create a service account credential from Google as a JSON file, with Google Drive API enabled.
Download the notebook, place the credential file in the same folder, and run the notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
__deprecated		__deprecated
analysis		analysis
analysis_v2		analysis_v2
.gitignore		.gitignore
DATABASE.xlsx - KT.csv		DATABASE.xlsx - KT.csv
DATABASE.xlsx - TLKT.csv		DATABASE.xlsx - TLKT.csv
README.md		README.md
notebook.ipynb		notebook.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FCDiffScorer

Proposed change

Relative difficulty curve

Calculation of True Participant count (`tp`)

About all the files

Running this notebook

About

Releases

Packages

Languages

snowynguyen/FCDiffScorer

Folders and files

Latest commit

History

Repository files navigation

FCDiffScorer

Proposed change

Relative difficulty curve

Calculation of True Participant count (tp)

About all the files

Running this notebook

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Calculation of True Participant count (`tp`)

Packages