Optimized atan2, _softmax, cat, clamp, full, relu, remainder, permute_copy_out ops and updates to use memory_allocator #7567

cad-audio · 2025-01-09T06:50:21Z

Summary

Optimized atan2, _softmax, cat, clamp, full, relu, remainder, permute_copy_out ops and updates to use memory_allocator

Test plan

Unit tested kernels

Adding mean and where ops optimized on HiFi

* adding pow, remainder, minimum, maximum operators * adding pow, remainder, minimum, maximum operators

Adding quantized linear optimized versions for int8 and uint8

* Adding cat, full, permute_copy and relu ops (pytorch#34) * Adding cat, full, permute_copy * updating relu wrt new ref (pytorch#36) * Temporary memory allocation, replacing mallocs (pytorch#38) * Integrated temporary mem alloc functionality in place of malloc * Namespace related changes * Cleanup the main application * Adding atan2, softmax, clamp and remainder ops (pytorch#37) * Replaced malloc with temp_memory_allocator --------- Co-authored-by: nishpoonia <94543206+nishpoonia@users.noreply.github.com> Co-authored-by: Rushi-cad <gherderu@cadence.com>

* adding ET_KERNEL_CHECK for allocate_temp_memory * solving lint error * Removing redundant check

Adding _softmax, relu, permute etc

pytorch-bot · 2025-01-09T06:50:25Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7567

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit fe5e7d7 with merge base 1bac885 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

dijopaul · 2025-01-09T06:52:32Z

@pytorchbot label "topic: not user facing"

- fixing build issue on previous commit

Update functions_hifi.yaml

…d removing exec_ten uses

Incorporating review comments: removing nesting to check data type an…

zonglinpeng

Looks good from eyeballing, will link internally and solve any issues in follow up diffs

zonglinpeng · 2025-01-16T17:19:29Z

examples/portable/executor_runner/executor_runner.cpp

@@ -172,6 +179,7 @@ int main(int argc, char** argv) {

  // Run the model.
  Error status = method->execute();
+


please avoid empty line changes

zonglinpeng · 2025-01-20T18:23:56Z

backends/cadence/hifi/operators/CMakeLists.txt

    "${EXECUTORCH_ROOT}/kernels/portable/cpu/op_clone.cpp"
    "${EXECUTORCH_ROOT}/kernels/portable/cpu/op_embedding.cpp"
-    "${EXECUTORCH_ROOT}/kernels/portable/cpu/op_full.cpp"
+    "${EXECUTORCH_ROOT}/kernels/portable/cpu/op_gt.cpp"


which op requires gt?

dijopaul and others added 17 commits October 23, 2024 06:51

Adding mean and where ops optimized on HiFi

216389c

Merge pull request #14 from dijopaul/main

3d849bb

Adding mean and where ops optimized on HiFi

Adding quantized linear optimized versions for int8 and uint8

9b71aed

adding pow, remainder, minimum, maximum operators (pytorch#33)

07743ab

* adding pow, remainder, minimum, maximum operators * adding pow, remainder, minimum, maximum operators

Fix for build issue faced in div_mod on old tools

edc1b3d

Merge pull request #15 from dijopaul/main

222beee

Adding quantized linear optimized versions for int8 and uint8

Merge branch 'main' into main

6e074ec

Fix build failure due to merge issue

afca3db

Merge branch 'main' into main

10a0ee0

Fixing review comments on PR 6867

f1f0bb3

Cleaning cmakelist to avoid duplications

911021f

Fixing lint issues and removing free statements

18cf518

adding ET_KERNEL_CHECK for allocate_temp_memory (pytorch#41)

5e471f2

* adding ET_KERNEL_CHECK for allocate_temp_memory * solving lint error * Removing redundant check

Merge branch 'main' into main_PR18

6928f95

Fixing lint error due to merge

991961b

Merge pull request #18 from dijopaul/main_PR18

7585ee0

Adding _softmax, relu, permute etc

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 9, 2025

pytorch-bot bot added the topic: not user facing label Jan 9, 2025

dijopaul and others added 2 commits January 9, 2025 13:57

Update functions_hifi.yaml

540243a

- fixing build issue on previous commit

Merge pull request #19 from dijopaul/patch-1

85e7c59

Update functions_hifi.yaml

hsharma35 self-requested a review January 9, 2025 17:07

nishpoonia and others added 3 commits January 10, 2025 12:01

Incorporating review comments: removing nesting to check data type an…

1f681c7

…d removing exec_ten uses

clean up

3539f52

Merge pull request #20 from dijopaul/main_PR18

fe5e7d7

Incorporating review comments: removing nesting to check data type an…

mcr229 requested review from digantdesai and kimishpatel January 14, 2025 00:56

kimishpatel added the module: cadence Issues related to the Cadence/Xtensa backend label Jan 14, 2025

kimishpatel requested review from mcremon-meta and tarun292 January 14, 2025 03:48

zonglinpeng approved these changes Jan 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimized atan2, _softmax, cat, clamp, full, relu, remainder, permute_copy_out ops and updates to use memory_allocator #7567

Optimized atan2, _softmax, cat, clamp, full, relu, remainder, permute_copy_out ops and updates to use memory_allocator #7567

cad-audio commented Jan 9, 2025

pytorch-bot bot commented Jan 9, 2025 •

edited

Loading

dijopaul commented Jan 9, 2025

zonglinpeng left a comment

zonglinpeng Jan 16, 2025

zonglinpeng Jan 20, 2025

		@@ -172,6 +179,7 @@ int main(int argc, char** argv) {

		// Run the model.
		Error status = method->execute();

Optimized atan2, _softmax, cat, clamp, full, relu, remainder, permute_copy_out ops and updates to use memory_allocator #7567

Are you sure you want to change the base?

Optimized atan2, _softmax, cat, clamp, full, relu, remainder, permute_copy_out ops and updates to use memory_allocator #7567

Conversation

cad-audio commented Jan 9, 2025

Summary

Test plan

pytorch-bot bot commented Jan 9, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7567

✅ No Failures

dijopaul commented Jan 9, 2025

zonglinpeng left a comment

Choose a reason for hiding this comment

zonglinpeng Jan 16, 2025

Choose a reason for hiding this comment

zonglinpeng Jan 20, 2025

Choose a reason for hiding this comment

pytorch-bot bot commented Jan 9, 2025 •

edited

Loading