Skip to content

Releases: lucazav/binclass-tools

Binclass-tools 1.1.2

14 Mar 15:36
Compare
Choose a tag to compare

Updates:

  • fixed plotly requirement in setup.py

Binclass-tools 1.1.1

07 Mar 16:37
Compare
Choose a tag to compare

Updates:

  • lift_curve_plot: xaxis range fixed

Binclass-tools 1.1.0

14 Mar 15:41
Compare
Choose a tag to compare

Release 1.1.0 of package Binclass-tools

Updates:

  • interactive confusion matrix: from version 1.1.0 the optimal thresholds dataframe returned in the confusion matrix plot will refer to the thresholds that give the best value of the implemented metrics (or the minimal Cost) for the given set of data, instead of the optimal threshold computed with GHOST method.
    The following parameters have been removed:

    • optimize_threshold,
    • N_subsets,
    • subsets_size,
    • with_replacement
  • get_ghost_optimal_thresholds_df: new name of function get_optimized_thresholds_df, behaviour remains the same except for parameter optimize_threshold for which "ROC" isn't supported anymore (was redundant, use Fscore instead)

  • get_ghost_optimal_threshold: new name of function get_optimal_threshold, behaviour remains the same except for parameter ThOpt_metrics for which "ROC" isn't supported anymore (was redundant, use Fscore instead)

  • get_ghost_optimal_cost: new name of function get_cost_optimal_threshold, behaviour remains the same

  • bug in interactive confusion matrix fixed

Binclass-tools 1.0.0

01 Mar 09:14
Compare
Choose a tag to compare

1.0.0 release of package Binclass-tools

Features multiple new graphic tools:

  • calibration_curve_plot: returns the calibration curve, also known as reliability diagram: plots the average predicted probability for each bin against the fraction of positive classes using true labels and predicted probabilities, and shows the error for each bin and the
    value of the Expected Calibration Error (ECE)

  • calibration_plot_from_models: allows to visually compare the calibration curves of different models and returns the ECE for each given model

  • cumulative_gain_plot: shows the percentage of targets reached when considering a chosen percentage of the population with the highest predicted probability of belonging to the target class

  • lift_curve_plot: shows the effectiveness of the model by computing the ratio between the result obtained with a model and the result that would be obtained by a random classifier

  • response_curve_plot: plots the percentage of actual target class records per decile, where the first decile is associated with the 10 percent of observation with the highest predicted probability of belonging to the target class.

  • cumulative_response_plot: plots the percentage of actual target class records per decile cumulatively

New utility function added:

  • get_expected_calibration_error: returns the ECE for the given model's predicted probabilities, with custom bin parameter

Updates for every graphical function :

  • From version 1.0.0, the behavior of functions that generate plots has changed: generated plots won't be shown directly when the function is called, the related Figure (Plotly) objects, dictionary-like, will be returned as the first outputs instead.

Binclass-tools 0.3.0

09 Sep 13:44
Compare
Choose a tag to compare

Release 0.3.0 of package Binclass-tools.

Features a new graphic tool:

  • predicted_proba_density_curve_plot: (interactive Plotly plot with slider) displays either the kernel density estimation curve (default behavior) or the normal distribution curve of the predicted probabilities, grouped by the relative true class (the type of curve can be chosen with the curve_type parameter). For each threshold, the regions that are correctly or incorrectly classified will be visualized with different colors.

Updates:

  • Every graphical function: added parameter show_display_modebar (default True) to choose whether to display the Plotly bar mode or not

  • curve_PR_plot and curve_ROC_plot: small graphical updates

  • confusion_linechart_plot:

    • bug fixed, now total_amount returned is None when amount parameter is None
    • small graphical updates

Binclass-tools 0.2.4

30 Aug 12:54
Compare
Choose a tag to compare

0.2.4 release of package Binclass-tools.

Updates:

  • setup requirements: added nbformat >= 4.2.0 to requirements for compatibility with the Plotly library.

  • compatibility with python versions 3.9 and 3.10

  • curve_PR_plot: added F-beta score to hover info of the plot.

Binclass-tools 0.2.2

19 Aug 20:23
Compare
Choose a tag to compare

0.2.2 release of package Binclass-tools.

Features three new graphic tools:

  • curve_ROC_plot: plots the Receiver Operating Characteristic (ROC) Curve with Plotly and returns the value of the area under the curve.
  • curve_PR_plot: plots the Precision-Recall (PR) Curve with iso-Fbeta curves (representing all points in the precision-recall space whose F-beta scores are equal) with Plotly. The function allows us to choose the value of the beta parameter and displays ISO curves associated with F-beta score values of 0.2, 0.4, 0.6 and 0.8. Returns, as in the ROC curve case, the value of the area under the curve:
  • predicted_proba_violin_plot: (interactive Plotly plot with slider) displays, through violin plots, the distribution of the predicted probabilities grouped by the relative true class. For each threshold, it allows to see whether the predicted probability for each data point generates a correct prediction or not.

Binclass-tools 0.1.7

19 Jul 14:20
Compare
Choose a tag to compare

First release of package Binclass-tools.

Features:

  • Powerful interactive charts that simplify the analysis of a binary classifier's performance, including any amounts and costs associated with individual observations.

  • A set of functions that return the values of metrics useful for measuring the performance of a binary classifier, for each threshold value if dependent on it.

  • A set of functions to find the optimal threshold value calculated on both the most popular metrics associated with the binary classifier under analysis, and any costs associated with each of the 4 categories in the confusion matrix.

  • A set of generic wrappers that help the analyst in daily operations dealing with binary classifications.