Regular expression denial of service in apache tika
Moderate severity
GitHub Reviewed
Published
May 17, 2022
to the GitHub Advisory Database
•
Updated Aug 24, 2023
Package
Affected versions
>= 2.0.0, < 2.4.0
>= 1.17, < 1.28.2
Patched versions
2.4.0
1.28.2
Description
Published by the National Vulnerability Database
May 16, 2022
Published to the GitHub Advisory Database
May 17, 2022
Reviewed
May 25, 2022
Last updated
Aug 24, 2023
In Apache Tika, a regular expression in our StandardsText class, used by the StandardsExtractingContentHandler could lead to a denial of service caused by backtracking on a specially crafted file. This only affects users who are running the StandardsExtractingContentHandler, which is a non-standard handler.
This was originally fixed in 1.28.2 and 2.4.0. While the fix in version 2.4.0 was complete, the fix for the 1.x branch wasn't incorporated until version 1.28.3. Please see GHSA-qw3f-w4pf-jh5f for more information.
References