selected_features for learners that don't support it should be the entirety of features seen in training #935

mb706 · 2023-06-29T19:06:23Z

This way we could correctly query a pipeline that selects features first and gives the result to a learner. The GraphLearner could then ask the learner at the end how many features it used, and if it is a learner that supports embedded featsel (rpart e.g.) then this would give the correct value, but even for learners that do not do support it the result could make sense.

Also this would solve mlr-org/mlr3fselect#87

be-marc · 2024-08-17T12:15:31Z

library(mlr3)
library(mlr3learners)

learner = lrn("classif.rpart")
task = tsk("spam")

learner$train(task)
learner$selected_features()

#> [1] "charDollar"      "hp"             
#> [3] "remove"          "charExclamation"
#> [5] "capitalTotal"    "free"  

learner = lrn("classif.log_reg")
learner$train(task)
learner$selected_features()
# > Error: attempt to apply non-function

berndbischl · 2024-12-19T10:05:03Z

so first order of business would be here to extend the docs, the docs don't say what happens if the property does not exists

mb706 · 2024-12-19T10:08:56Z

currently mlr3pipelines handles this on its own end if this flag is set:

https://github.com/mlr-org/mlr3pipelines/blob/b1042d7967d13207276f6c1e429dfca86c76416f/R/GraphLearner.R#L73-L84

berndbischl · 2024-12-19T10:14:57Z

i would suggest this
a) the learner has an member var / option:
selected_features_not_supported = "error" / "all"
this controls what happens in "selected_features"

b) selected_features as a method is present in all learners.
by default it return an error (if not implemented)

mb706 · 2024-12-19T10:17:58Z

selected_features_impute

berndbischl · 2024-12-19T10:19:42Z

we also need to remove this from pipelines then

be-marc · 2024-12-20T14:48:44Z

Closed by #1230

berndbischl added the Workshop label Aug 16, 2024

berndbischl self-assigned this Aug 17, 2024

berndbischl assigned be-marc and unassigned berndbischl Dec 19, 2024

be-marc closed this as completed Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

selected_features for learners that don't support it should be the entirety of features seen in training #935

selected_features for learners that don't support it should be the entirety of features seen in training #935

mb706 commented Jun 29, 2023

be-marc commented Aug 17, 2024

berndbischl commented Dec 19, 2024

mb706 commented Dec 19, 2024

berndbischl commented Dec 19, 2024

mb706 commented Dec 19, 2024

berndbischl commented Dec 19, 2024

be-marc commented Dec 20, 2024

selected_features for learners that don't support it should be the entirety of features seen in training #935

selected_features for learners that don't support it should be the entirety of features seen in training #935

Comments

mb706 commented Jun 29, 2023

be-marc commented Aug 17, 2024

berndbischl commented Dec 19, 2024

mb706 commented Dec 19, 2024

berndbischl commented Dec 19, 2024

mb706 commented Dec 19, 2024

berndbischl commented Dec 19, 2024

be-marc commented Dec 20, 2024