refactor: Refactor inpage blocklist to avoid usage of regex #8675

NicholasEllul · 2024-02-22T16:48:09Z

Description

This PR fixes two bugs that occured as the result of using regex to identify URLs in our content script blocklist.

The first issue is that we were only escaping the first . found in a URL when using the inpage blocklist. This meant that entries such as ani.gamer.com.tw would have their first period escaped for regex parsing, but subsequent periods were treated as regex wildcards. This could lead to and unintentionally matching on URLs such as ani.gamerxcom.tw etc.

The second issue is that we were missing a leading anchor ^ in the regex expression. This means that we would block the domain if the matched string occurred anywhere in the URL. For an example, https://google.com?search=uscourts.gov would be a blocked domain since it ended in uscourts.gov. Adding the leading anchor addresses this so we only match the correct domain.

To avoid future regex complexities, this code has been refactored to use built in javascript URL parsing instead.

Related issues

https://github.com/MetaMask/mobile-planning/issues/1571

Manual testing steps

Pre-merge author checklist

Pre-merge reviewer checklist

I've manually tested the PR (e.g. pull and build branch, run the app, test code being changed).
I confirm that this PR addresses all acceptance criteria described in the ticket it closes and includes the necessary testing evidence such as recordings and or screenshots.

Note: Issue with testing has been created here: #9009

github-actions · 2024-02-22T16:48:20Z

CLA Signature Action: All authors have signed the CLA. You may need to manually re-run the blocking PR check if it doesn't pass in a few minutes.

github-actions · 2024-02-22T17:01:23Z

E2E test started on Bitrise: https://app.bitrise.io/app/be69d4368ee7e86d/pipelines/63f2cd35-fd45-4f0e-b96d-5bbb17b85e18
You can also kick off another Bitrise E2E smoke test by removing and re-applying the (Run Smoke E2E) label

tommasini

LGTM!

scripts/inpage-bridge/content-script/index.js

NicholasEllul · 2024-02-22T17:21:54Z

@tommasini also I struggled to mock out window in my jest tests, if you have any suggestions on how I could export this function & test it, please let me know

davidmurdoch

Tested by copying the new function from here, visiting websites in the browser, and pasting the function and a call to blockedDomainCheck() into the console.

It seems to work very well!

NicholasEllul · 2024-03-06T15:23:17Z

I've updated this to match the latest changes found on MetaMask/metamask-extension#23134

github-actions · 2024-03-06T15:38:47Z

Bitrise

🔄🔄🔄 pr_smoke_e2e_pipeline started on Bitrise...🔄🔄🔄

Commit hash: 659f495
Build link: https://app.bitrise.io/app/be69d4368ee7e86d/pipelines/0da6d2eb-0b2a-49fd-ae06-68c6d7e2e046

Note

This comment will auto-update when build completes
You can kick off another pr_smoke_e2e_pipeline on Bitrise by removing and re-applying the Run Smoke E2E label on the pull request

github-actions · 2024-03-07T19:17:14Z

Bitrise

✅✅✅ pr_smoke_e2e_pipeline passed on Bitrise! ✅✅✅

Commit hash: 19b4d5c
Build link: https://app.bitrise.io/app/be69d4368ee7e86d/pipelines/16d196b1-482c-450a-88e4-7094c1bbdb1c

Note

You can kick off another pr_smoke_e2e_pipeline on Bitrise by removing and re-applying the Run Smoke E2E label on the pull request

sonarcloud · 2024-03-07T19:25:30Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
No data about Coverage
No data about Duplication

See analysis details on SonarCloud

codecov-commenter · 2024-03-07T19:25:54Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 43.23%. Comparing base (92c9521) to head (19b4d5c).

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #8675   +/-   ##
=======================================
  Coverage   43.23%   43.23%           
=======================================
  Files        1271     1271           
  Lines       30905    30905           
  Branches     3088     3088           
=======================================
  Hits        13361    13361           
  Misses      16769    16769           
  Partials      775      775

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

NicolasMassart

Could you update the tests?
As you provided an example of dangerous url that could pass with the regex, to confirm the issue is solved, you could add a test case in the unit tests matching against this ani.gamerxcom.tw type of URL.

Same for the second case with https://google.com?search=uscourts.gov

NicholasEllul · 2024-03-20T14:14:09Z

@NicolasMassart I tried to get some kind of unit test set up and running but struggled trying to figure out how to mock out some of the browser window functions with the current setup. Unlike in extension (https://github.com/MetaMask/metamask-extension/pull/23134/files), the code in this file gets executed the moment it gets imported resulting in difficulties getting a suite of multiple tests running correctly.

NicolasMassart

@NicolasMassart I tried to get some kind of unit test set up and running but struggled trying to figure out how to mock out some of the browser window functions with the current setup. Unlike in extension (https://github.com/MetaMask/metamask-extension/pull/23134/files), the code in this file gets executed the moment it gets imported resulting in difficulties getting a suite of multiple tests running correctly.

As discussed, we don't have any ways to test this right now but we are aware it should be.
So creating an issue to investigate testing this injected script is a good start. (please post the link in this PR).

NicholasEllul · 2024-03-20T16:11:16Z

Related issue has been created here: #9009

NicholasEllul mentioned this pull request Feb 22, 2024

fix: Address issues with inaccurate regex expressions #8661

Closed

13 tasks

NicholasEllul changed the title ~~Refactor content-script blocklist to avoid usage of regex~~ refactor: Refactor content-script blocklist to avoid usage of regex Feb 22, 2024

metamaskbot added the INVALID-PR-TEMPLATE PR's body doesn't match template label Feb 22, 2024

NicholasEllul added the team-mobile-platform label Feb 22, 2024

NicholasEllul marked this pull request as ready for review February 22, 2024 16:59

NicholasEllul requested a review from a team as a code owner February 22, 2024 16:59

github-actions bot added the Run Smoke E2E Triggers smoke e2e on Bitrise label Feb 22, 2024

NicholasEllul changed the title ~~refactor: Refactor content-script blocklist to avoid usage of regex~~ refactor: Refactor inpage blocklist to avoid usage of regex Feb 22, 2024

tommasini previously approved these changes Feb 22, 2024

View reviewed changes

scripts/inpage-bridge/content-script/index.js Outdated Show resolved Hide resolved

Gudahtt reviewed Feb 22, 2024

View reviewed changes

scripts/inpage-bridge/content-script/index.js Outdated Show resolved Hide resolved

NicholasEllul dismissed tommasini’s stale review via a1c30f0 February 22, 2024 17:21

NicholasEllul requested review from tommasini and Gudahtt February 22, 2024 17:33

davidmurdoch previously approved these changes Feb 22, 2024

View reviewed changes

NicholasEllul requested a review from a team February 22, 2024 20:53

NicholasEllul dismissed davidmurdoch’s stale review via 0ee0612 March 6, 2024 15:21

Refactor content-script blocklist to avoid usage of regex

659f495

NicholasEllul force-pushed the ellul/remove-blocklist-regex branch from 0ee0612 to 659f495 Compare March 6, 2024 15:22

NicholasEllul added the needs-dev-review PR needs reviews from other engineers (in order to receive required approvals) label Mar 6, 2024

NicholasEllul added Run Smoke E2E Triggers smoke e2e on Bitrise and removed Run Smoke E2E Triggers smoke e2e on Bitrise labels Mar 6, 2024

davidmurdoch approved these changes Mar 6, 2024

View reviewed changes

Merge branch 'main' into ellul/remove-blocklist-regex

19b4d5c

NicholasEllul added Run Smoke E2E Triggers smoke e2e on Bitrise and removed Run Smoke E2E Triggers smoke e2e on Bitrise labels Mar 7, 2024

NicolasMassart reviewed Mar 20, 2024

View reviewed changes

NicolasMassart approved these changes Mar 20, 2024

View reviewed changes

NicholasEllul mentioned this pull request Mar 20, 2024

content-script lacks tests required to ensure stability and accuracy #9009

Open

9 tasks

NicholasEllul merged commit 5bf8452 into main Mar 20, 2024
33 of 36 checks passed

NicholasEllul deleted the ellul/remove-blocklist-regex branch March 20, 2024 16:13

github-actions bot locked and limited conversation to collaborators Mar 20, 2024

github-actions bot removed the needs-dev-review PR needs reviews from other engineers (in order to receive required approvals) label Mar 20, 2024

metamaskbot added the release-7.20.0 Issue or pull request that will be included in release 7.20.0 label Mar 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: Refactor inpage blocklist to avoid usage of regex #8675

refactor: Refactor inpage blocklist to avoid usage of regex #8675

NicholasEllul commented Feb 22, 2024 •

edited

Loading

github-actions bot commented Feb 22, 2024

github-actions bot commented Feb 22, 2024

tommasini left a comment

NicholasEllul commented Feb 22, 2024

davidmurdoch left a comment

NicholasEllul commented Mar 6, 2024

github-actions bot commented Mar 6, 2024

github-actions bot commented Mar 7, 2024 •

edited by metamaskbot

Loading

sonarcloud bot commented Mar 7, 2024

codecov-commenter commented Mar 7, 2024

NicolasMassart left a comment

NicholasEllul commented Mar 20, 2024

NicolasMassart left a comment

NicholasEllul commented Mar 20, 2024

refactor: Refactor inpage blocklist to avoid usage of regex #8675

refactor: Refactor inpage blocklist to avoid usage of regex #8675

Conversation

NicholasEllul commented Feb 22, 2024 • edited Loading

Description

Related issues

Manual testing steps

Pre-merge author checklist

Pre-merge reviewer checklist

github-actions bot commented Feb 22, 2024

github-actions bot commented Feb 22, 2024

tommasini left a comment

Choose a reason for hiding this comment

NicholasEllul commented Feb 22, 2024

davidmurdoch left a comment

Choose a reason for hiding this comment

NicholasEllul commented Mar 6, 2024

github-actions bot commented Mar 6, 2024

Bitrise

github-actions bot commented Mar 7, 2024 • edited by metamaskbot Loading

Bitrise

sonarcloud bot commented Mar 7, 2024

Quality Gate passed

codecov-commenter commented Mar 7, 2024

Codecov Report

NicolasMassart left a comment

Choose a reason for hiding this comment

NicholasEllul commented Mar 20, 2024

NicolasMassart left a comment

Choose a reason for hiding this comment

NicholasEllul commented Mar 20, 2024

NicholasEllul commented Feb 22, 2024 •

edited

Loading

github-actions bot commented Mar 7, 2024 •

edited by metamaskbot

Loading