-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature,Refactor] Refactor from_dict, add from_any, from_dataclass #1102
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Nov 22, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 34.1030μs | 17.2870μs | 57.8470 KOps/s | 56.3746 KOps/s | |
test_plain_set_stack_nested | 65.0210μs | 17.4660μs | 57.2542 KOps/s | 56.0036 KOps/s | |
test_plain_set_nested_inplace | 55.3240μs | 18.9520μs | 52.7649 KOps/s | 51.5807 KOps/s | |
test_plain_set_stack_nested_inplace | 72.4350μs | 18.8674μs | 53.0016 KOps/s | 52.2718 KOps/s | |
test_items | 21.6310μs | 4.1198μs | 242.7306 KOps/s | 243.4579 KOps/s | |
test_items_nested | 0.4708ms | 0.3333ms | 3.0002 KOps/s | 2.8415 KOps/s | |
test_items_nested_locked | 0.6368ms | 0.3329ms | 3.0037 KOps/s | 2.8708 KOps/s | |
test_items_nested_leaf | 0.1356ms | 71.4812μs | 13.9897 KOps/s | 14.0393 KOps/s | |
test_items_stack_nested | 0.4881ms | 0.3383ms | 2.9557 KOps/s | 2.8419 KOps/s | |
test_items_stack_nested_leaf | 0.1323ms | 75.5267μs | 13.2403 KOps/s | 13.4843 KOps/s | |
test_items_stack_nested_locked | 0.4903ms | 0.3332ms | 3.0015 KOps/s | 2.7671 KOps/s | |
test_keys | 22.6730μs | 3.8586μs | 259.1585 KOps/s | 282.0440 KOps/s | |
test_keys_nested | 0.2286ms | 0.1345ms | 7.4353 KOps/s | 7.3717 KOps/s | |
test_keys_nested_locked | 0.7490ms | 0.1394ms | 7.1740 KOps/s | 7.1005 KOps/s | |
test_keys_nested_leaf | 1.8942ms | 0.1165ms | 8.5854 KOps/s | 8.5536 KOps/s | |
test_keys_stack_nested | 0.2239ms | 0.1343ms | 7.4446 KOps/s | 7.1735 KOps/s | |
test_keys_stack_nested_leaf | 0.1927ms | 0.1142ms | 8.7581 KOps/s | 8.5811 KOps/s | |
test_keys_stack_nested_locked | 0.2301ms | 0.1391ms | 7.1912 KOps/s | 7.0244 KOps/s | |
test_values | 6.7000μs | 1.0260μs | 974.6449 KOps/s | 963.5079 KOps/s | |
test_values_nested | 0.1001ms | 54.5122μs | 18.3445 KOps/s | 18.0208 KOps/s | |
test_values_nested_locked | 0.1201ms | 55.2840μs | 18.0884 KOps/s | 18.1233 KOps/s | |
test_values_nested_leaf | 0.1062ms | 59.8788μs | 16.7004 KOps/s | 16.6361 KOps/s | |
test_values_stack_nested | 0.1053ms | 56.7540μs | 17.6199 KOps/s | 16.8062 KOps/s | |
test_values_stack_nested_leaf | 0.1097ms | 59.0373μs | 16.9385 KOps/s | 16.3261 KOps/s | |
test_values_stack_nested_locked | 0.1089ms | 57.0036μs | 17.5427 KOps/s | 17.6031 KOps/s | |
test_membership | 6.5280μs | 0.7469μs | 1.3388 MOps/s | 1.1018 MOps/s | |
test_membership_nested | 43.1400μs | 2.8189μs | 354.7493 KOps/s | 365.0361 KOps/s | |
test_membership_nested_leaf | 23.5540μs | 2.8578μs | 349.9173 KOps/s | 357.4106 KOps/s | |
test_membership_stacked_nested | 43.9210μs | 2.7756μs | 360.2785 KOps/s | 359.4974 KOps/s | |
test_membership_stacked_nested_leaf | 23.0830μs | 2.8034μs | 356.7145 KOps/s | 361.4756 KOps/s | |
test_membership_nested_last | 46.9570μs | 4.0779μs | 245.2232 KOps/s | 246.9645 KOps/s | |
test_membership_nested_leaf_last | 35.4450μs | 4.0621μs | 246.1788 KOps/s | 245.3356 KOps/s | |
test_membership_stacked_nested_last | 0.1060ms | 7.5913μs | 131.7301 KOps/s | 195.0668 KOps/s | |
test_membership_stacked_nested_leaf_last | 75.7500μs | 7.5953μs | 131.6600 KOps/s | 193.5114 KOps/s | |
test_nested_getleaf | 50.7450μs | 10.4127μs | 96.0366 KOps/s | 94.7349 KOps/s | |
test_nested_get | 51.1850μs | 9.9014μs | 100.9954 KOps/s | 98.9429 KOps/s | |
test_stacked_getleaf | 35.3960μs | 10.4981μs | 95.2551 KOps/s | 94.9434 KOps/s | |
test_stacked_get | 50.6590μs | 9.7251μs | 102.8271 KOps/s | 99.0751 KOps/s | |
test_nested_getitemleaf | 36.5880μs | 10.9133μs | 91.6311 KOps/s | 90.2462 KOps/s | |
test_nested_getitem | 54.7520μs | 10.2976μs | 97.1103 KOps/s | 96.7406 KOps/s | |
test_stacked_getitemleaf | 39.8240μs | 10.7964μs | 92.6232 KOps/s | 90.2104 KOps/s | |
test_stacked_getitem | 48.9610μs | 10.0986μs | 99.0236 KOps/s | 97.9131 KOps/s | |
test_lock_nested | 1.9152ms | 0.4342ms | 2.3031 KOps/s | 1.8559 KOps/s | |
test_lock_stack_nested | 0.6106ms | 0.4028ms | 2.4829 KOps/s | 2.4118 KOps/s | |
test_unlock_nested | 0.6661ms | 0.3530ms | 2.8328 KOps/s | 2.7649 KOps/s | |
test_unlock_stack_nested | 0.4987ms | 0.3203ms | 3.1218 KOps/s | 3.0073 KOps/s | |
test_flatten_speed | 0.2071ms | 93.1737μs | 10.7326 KOps/s | 10.6823 KOps/s | |
test_unflatten_speed | 0.8802ms | 0.4786ms | 2.0893 KOps/s | 2.0879 KOps/s | |
test_common_ops | 6.0371ms | 0.7966ms | 1.2553 KOps/s | 1.3015 KOps/s | |
test_creation | 17.0620μs | 2.0621μs | 484.9500 KOps/s | 480.0849 KOps/s | |
test_creation_empty | 36.3880μs | 10.3491μs | 96.6264 KOps/s | 92.9795 KOps/s | |
test_creation_nested_1 | 42.6490μs | 13.0626μs | 76.5543 KOps/s | 72.8668 KOps/s | |
test_creation_nested_2 | 48.3000μs | 17.6919μs | 56.5232 KOps/s | 56.8051 KOps/s | |
test_clone | 45.2840μs | 13.1196μs | 76.2218 KOps/s | 72.8570 KOps/s | |
test_getitem[int] | 0.9921ms | 12.3675μs | 80.8573 KOps/s | 76.5172 KOps/s | |
test_getitem[slice_int] | 0.1361ms | 23.2593μs | 42.9936 KOps/s | 38.9394 KOps/s | |
test_getitem[range] | 0.1744ms | 48.4228μs | 20.6514 KOps/s | 20.5514 KOps/s | |
test_getitem[tuple] | 0.1383ms | 19.4972μs | 51.2895 KOps/s | 48.4659 KOps/s | |
test_getitem[list] | 0.1961ms | 43.6847μs | 22.8913 KOps/s | 22.5987 KOps/s | |
test_setitem_dim[int] | 57.7580μs | 24.7905μs | 40.3380 KOps/s | 39.2229 KOps/s | |
test_setitem_dim[slice_int] | 86.1200μs | 50.1646μs | 19.9344 KOps/s | 19.4117 KOps/s | |
test_setitem_dim[range] | 0.1170ms | 72.6845μs | 13.7581 KOps/s | 13.5223 KOps/s | |
test_setitem_dim[tuple] | 74.9800μs | 40.4840μs | 24.7011 KOps/s | 24.4349 KOps/s | |
test_setitem | 56.7160μs | 19.6712μs | 50.8356 KOps/s | 47.7425 KOps/s | |
test_set | 0.1642ms | 18.9497μs | 52.7712 KOps/s | 48.2414 KOps/s | |
test_set_shared | 1.3216ms | 0.1691ms | 5.9151 KOps/s | 5.8857 KOps/s | |
test_update | 0.2022ms | 20.8645μs | 47.9282 KOps/s | 43.6360 KOps/s | |
test_update_nested | 0.2165ms | 31.3960μs | 31.8511 KOps/s | 30.2551 KOps/s | |
test_update__nested | 0.6793ms | 32.3547μs | 30.9074 KOps/s | 29.7656 KOps/s | |
test_set_nested | 0.1417ms | 20.8131μs | 48.0466 KOps/s | 44.8777 KOps/s | |
test_set_nested_new | 0.2446ms | 25.5363μs | 39.1600 KOps/s | 37.6342 KOps/s | |
test_select | 0.2688ms | 41.1061μs | 24.3273 KOps/s | 23.1139 KOps/s | |
test_select_nested | 0.1276ms | 59.6169μs | 16.7738 KOps/s | 16.4294 KOps/s | |
test_exclude_nested | 0.1358ms | 73.8328μs | 13.5441 KOps/s | 13.2114 KOps/s | |
test_empty[True] | 0.5639ms | 0.3470ms | 2.8820 KOps/s | 2.8365 KOps/s | |
test_empty[False] | 10.6897μs | 1.1889μs | 841.0938 KOps/s | 802.0429 KOps/s | |
test_unbind_speed | 0.4586ms | 0.2568ms | 3.8939 KOps/s | 3.7382 KOps/s | |
test_unbind_speed_stack0 | 0.4941ms | 0.2518ms | 3.9714 KOps/s | 3.8478 KOps/s | |
test_unbind_speed_stack1 | 0.1020s | 0.7463ms | 1.3400 KOps/s | 1.4110 KOps/s | |
test_split | 2.4748ms | 1.5662ms | 638.5084 Ops/s | 565.7705 Ops/s | |
test_chunk | 0.1020s | 1.8673ms | 535.5227 Ops/s | 566.1277 Ops/s | |
test_consolidate_njt[False-None] | 9.6366ms | 8.1948ms | 122.0289 Ops/s | 120.3956 Ops/s | |
test_creation[device0] | 3.9614ms | 92.2221μs | 10.8434 KOps/s | 10.7099 KOps/s | |
test_creation_from_tensor | 0.2554ms | 94.9987μs | 10.5265 KOps/s | 10.3980 KOps/s | |
test_add_one[memmap_tensor0] | 0.2270ms | 4.6118μs | 216.8329 KOps/s | 194.9969 KOps/s | |
test_contiguous[memmap_tensor0] | 8.7260μs | 0.5118μs | 1.9541 MOps/s | 1.9214 MOps/s | |
test_stack[memmap_tensor0] | 55.3730μs | 3.3043μs | 302.6331 KOps/s | 284.3044 KOps/s | |
test_memmaptd_index | 0.4455ms | 0.2277ms | 4.3915 KOps/s | 4.0766 KOps/s | |
test_memmaptd_index_astensor | 0.5367ms | 0.3045ms | 3.2839 KOps/s | 3.0965 KOps/s | |
test_memmaptd_index_op | 1.4144ms | 0.5639ms | 1.7732 KOps/s | 1.6572 KOps/s | |
test_serialize_model | 0.1203s | 0.1177s | 8.4928 Ops/s | 7.6414 Ops/s | |
test_serialize_model_pickle | 0.4954s | 0.3914s | 2.5546 Ops/s | 2.5950 Ops/s | |
test_serialize_weights | 0.2182s | 0.1301s | 7.6863 Ops/s | 8.9022 Ops/s | |
test_serialize_weights_returnearly | 0.1717s | 0.1588s | 6.2973 Ops/s | 6.3515 Ops/s | |
test_serialize_weights_pickle | 1.1507s | 0.7034s | 1.4217 Ops/s | 2.5563 Ops/s | |
test_serialize_weights_filesystem | 0.1470s | 0.1411s | 7.0872 Ops/s | 6.4512 Ops/s | |
test_serialize_model_filesystem | 0.2491s | 0.1533s | 6.5214 Ops/s | 6.7168 Ops/s | |
test_reshape_pytree | 60.5230μs | 26.6092μs | 37.5810 KOps/s | 36.5519 KOps/s | |
test_reshape_td | 87.9540μs | 31.9148μs | 31.3334 KOps/s | 29.2872 KOps/s | |
test_view_pytree | 57.3060μs | 26.4279μs | 37.8387 KOps/s | 37.2346 KOps/s | |
test_view_td | 77.3240μs | 36.9628μs | 27.0542 KOps/s | 24.9384 KOps/s | |
test_unbind_pytree | 79.5680μs | 29.5646μs | 33.8243 KOps/s | 33.0724 KOps/s | |
test_unbind_td | 0.3192ms | 37.7963μs | 26.4576 KOps/s | 25.4117 KOps/s | |
test_split_pytree | 73.3970μs | 29.2527μs | 34.1849 KOps/s | 33.0969 KOps/s | |
test_split_td | 0.5412ms | 43.3068μs | 23.0911 KOps/s | 21.1785 KOps/s | |
test_add_pytree | 75.4010μs | 34.9125μs | 28.6431 KOps/s | 27.0223 KOps/s | |
test_add_td | 0.1317ms | 55.0160μs | 18.1765 KOps/s | 17.2024 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1557ms | 61.7519μs | 16.1938 KOps/s | 16.3039 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 1.4120ms | 0.1580ms | 6.3281 KOps/s | 6.2149 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1082ms | 44.7869μs | 22.3279 KOps/s | 21.6575 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2241ms | 0.1174ms | 8.5206 KOps/s | 8.2820 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 73.0360μs | 25.4129μs | 39.3500 KOps/s | 38.5797 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1225ms | 54.8542μs | 18.2301 KOps/s | 18.6301 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1498ms | 79.2385μs | 12.6201 KOps/s | 12.5375 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1538ms | 68.1158μs | 14.6809 KOps/s | 14.6091 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2018ms | 0.1038ms | 9.6378 KOps/s | 9.3719 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3213ms | 0.1936ms | 5.1648 KOps/s | 4.9857 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 88.7450μs | 44.1390μs | 22.6557 KOps/s | 22.3760 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4773ms | 60.8717μs | 16.4280 KOps/s | 16.3307 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1870ms | 0.1017ms | 9.8339 KOps/s | 9.6050 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3715ms | 0.2018ms | 4.9559 KOps/s | 4.8608 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3409ms | 0.2046ms | 4.8865 KOps/s | 4.7165 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2032ms | 0.1055ms | 9.4763 KOps/s | 9.3426 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2444ms | 57.0981μs | 17.5137 KOps/s | 18.3048 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1232ms | 46.2514μs | 21.6210 KOps/s | 20.9764 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 1.6657ms | 0.1605ms | 6.2288 KOps/s | 6.2372 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1837ms | 0.1017ms | 9.8299 KOps/s | 9.5112 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 59.0600μs | 20.9775μs | 47.6702 KOps/s | 46.9939 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1342ms | 57.7505μs | 17.3159 KOps/s | 16.7974 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1754ms | 82.6690μs | 12.0964 KOps/s | 12.1435 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1361ms | 68.3195μs | 14.6371 KOps/s | 14.3555 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.7159ms | 0.2051ms | 4.8755 KOps/s | 4.6262 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.4894ms | 1.2318ms | 811.7940 Ops/s | 771.6898 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3275ms | 0.1987ms | 5.0330 KOps/s | 4.7738 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.5917ms | 0.7686ms | 1.3011 KOps/s | 1.2668 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.5306ms | 0.4500ms | 2.2224 KOps/s | 2.1390 KOps/s | |
test_compile_assign_and_add_stack[eager] | 2.6093ms | 2.4827ms | 402.7908 Ops/s | 384.7244 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 83.9460μs | 35.3448μs | 28.2927 KOps/s | 27.9305 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5600ms | 31.6103μs | 31.6353 KOps/s | 29.4471 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 80.3900μs | 28.3866μs | 35.2279 KOps/s | 34.2757 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 62.2760μs | 22.7733μs | 43.9111 KOps/s | 43.2975 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 72.0240μs | 29.0890μs | 34.3773 KOps/s | 33.4281 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 62.2260μs | 22.7869μs | 43.8849 KOps/s | 43.1447 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1083ms | 51.1309μs | 19.5576 KOps/s | 19.0059 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.3474ms | 18.7119μs | 53.4419 KOps/s | 48.1986 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1494ms | 44.3547μs | 22.5455 KOps/s | 22.0423 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 59.4100μs | 18.6061μs | 53.7458 KOps/s | 52.1190 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1092ms | 45.0308μs | 22.2070 KOps/s | 21.6272 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 94.1740μs | 18.7034μs | 53.4663 KOps/s | 52.3221 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1206ms | 51.5705μs | 19.3909 KOps/s | 18.5757 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9293ms | 19.1896μs | 52.1117 KOps/s | 47.7547 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1071ms | 44.4743μs | 22.4849 KOps/s | 21.5909 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 70.6110μs | 18.6178μs | 53.7120 KOps/s | 51.5279 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1022ms | 43.8393μs | 22.8106 KOps/s | 21.4974 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.4314ms | 18.5020μs | 54.0481 KOps/s | 52.6353 KOps/s | |
test_mod_add[eager] | 70.7420μs | 25.1104μs | 39.8241 KOps/s | 37.1479 KOps/s | |
test_mod_add[compile] | 89.2470μs | 43.5611μs | 22.9562 KOps/s | 22.0727 KOps/s | |
test_mod_add[compile-overhead] | 93.7950μs | 42.9986μs | 23.2565 KOps/s | 21.2648 KOps/s | |
test_mod_wrap[eager] | 0.3077ms | 0.2090ms | 4.7838 KOps/s | 4.6559 KOps/s | |
test_mod_wrap[compile] | 1.8893ms | 0.2004ms | 4.9911 KOps/s | 4.7532 KOps/s | |
test_mod_wrap[compile-overhead] | 1.7310ms | 0.1971ms | 5.0746 KOps/s | 4.6126 KOps/s | |
test_mod_wrap_and_backward[eager] | 14.0737ms | 11.0926ms | 90.1503 Ops/s | 93.3450 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.7313ms | 12.4913ms | 80.0554 Ops/s | 87.2356 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 13.8482ms | 11.8640ms | 84.2883 Ops/s | 93.8170 Ops/s | |
test_seq_add[eager] | 0.2176ms | 90.2009μs | 11.0864 KOps/s | 10.7605 KOps/s | |
test_seq_add[compile] | 0.1128ms | 57.6025μs | 17.3604 KOps/s | 16.5197 KOps/s | |
test_seq_add[compile-overhead] | 0.1413ms | 57.1612μs | 17.4944 KOps/s | 16.5699 KOps/s | |
test_seq_wrap[eager] | 0.5687ms | 0.3848ms | 2.5985 KOps/s | 2.5110 KOps/s | |
test_seq_wrap[compile] | 0.3591ms | 0.2215ms | 4.5142 KOps/s | 4.2936 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4073ms | 0.2219ms | 4.5073 KOps/s | 4.3496 KOps/s | |
test_func_call_runtime[False-eager] | 0.7679ms | 0.5324ms | 1.8784 KOps/s | 1.8023 KOps/s | |
test_func_call_runtime[False-compile] | 0.5082ms | 0.4166ms | 2.4002 KOps/s | 2.2897 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.9115ms | 0.4190ms | 2.3864 KOps/s | 2.2869 KOps/s | |
test_func_call_runtime[True-eager] | 1.1993ms | 0.7445ms | 1.3432 KOps/s | 1.2912 KOps/s | |
test_func_call_runtime[True-compile] | 1.0872ms | 0.4601ms | 2.1737 KOps/s | 2.1088 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5948ms | 0.4522ms | 2.2112 KOps/s | 2.0806 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9855ms | 0.5276ms | 1.8953 KOps/s | 1.8013 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.5194ms | 0.4149ms | 2.4105 KOps/s | 2.2642 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5536ms | 0.4142ms | 2.4145 KOps/s | 2.2759 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.3418ms | 0.8693ms | 1.1504 KOps/s | 1.1087 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.6017ms | 0.4782ms | 2.0912 KOps/s | 1.9835 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5790ms | 0.4795ms | 2.0855 KOps/s | 1.9910 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.6558ms | 1.8684ms | 535.2033 Ops/s | 516.8055 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.0098ms | 0.5075ms | 1.9703 KOps/s | 1.8930 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.9966ms | 0.5083ms | 1.9675 KOps/s | 1.8880 KOps/s | |
test_distributed | 0.2546ms | 0.1258ms | 7.9504 KOps/s | 7.8288 KOps/s | |
test_tdmodule | 33.9740μs | 18.3381μs | 54.5312 KOps/s | 53.9625 KOps/s | |
test_tdmodule_dispatch | 82.0930μs | 39.4660μs | 25.3382 KOps/s | 27.5962 KOps/s | |
test_tdseq | 42.3890μs | 21.0052μs | 47.6073 KOps/s | 47.2447 KOps/s | |
test_tdseq_dispatch | 78.8970μs | 43.2497μs | 23.1215 KOps/s | 20.6295 KOps/s | |
test_instantiation_functorch | 1.7781ms | 1.4981ms | 667.5203 Ops/s | 647.6180 Ops/s | |
test_exec_functorch | 0.3968ms | 0.1791ms | 5.5822 KOps/s | 5.3712 KOps/s | |
test_exec_functional_call | 0.4471ms | 0.1727ms | 5.7906 KOps/s | 5.6065 KOps/s | |
test_exec_td_decorator | 0.4808ms | 0.2241ms | 4.4622 KOps/s | 4.2987 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9224ms | 0.6383ms | 1.5666 KOps/s | 1.5432 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9513ms | 0.6377ms | 1.5681 KOps/s | 1.5391 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7763ms | 0.5262ms | 1.9006 KOps/s | 1.8658 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8411ms | 0.5229ms | 1.9124 KOps/s | 1.8672 KOps/s | |
test_to_module_speed[True] | 2.0473ms | 1.3008ms | 768.7777 Ops/s | 784.4814 Ops/s | |
test_to_module_speed[False] | 1.9752ms | 1.2552ms | 796.6598 Ops/s | 796.4047 Ops/s | |
test_tc_init | 81.1920μs | 45.6105μs | 21.9248 KOps/s | 21.8243 KOps/s | |
test_tc_init_nested | 0.1603ms | 89.9266μs | 11.1202 KOps/s | 10.7845 KOps/s | |
test_tc_first_layer_tensor | 17.7030μs | 1.5086μs | 662.8518 KOps/s | 665.6418 KOps/s | |
test_tc_first_layer_nontensor | 44.4630μs | 4.6173μs | 216.5784 KOps/s | 208.1628 KOps/s | |
test_tc_second_layer_tensor | 20.7280μs | 2.7547μs | 363.0162 KOps/s | 356.6495 KOps/s | |
test_tc_second_layer_nontensor | 46.9370μs | 5.9589μs | 167.8157 KOps/s | 162.0685 KOps/s | |
test_unbind | 0.2155s | 11.2444ms | 88.9331 Ops/s | 84.4262 Ops/s | |
test_full_like | 17.8120ms | 11.7449ms | 85.1431 Ops/s | 88.4085 Ops/s | |
test_zeros_like | 14.8970ms | 7.4934ms | 133.4510 Ops/s | 136.3769 Ops/s | |
test_ones_like | 9.6305ms | 7.7323ms | 129.3274 Ops/s | 133.8260 Ops/s | |
test_clone | 12.4458ms | 9.1106ms | 109.7623 Ops/s | 108.5650 Ops/s | |
test_squeeze | 56.6780μs | 11.8531μs | 84.3664 KOps/s | 83.2392 KOps/s | |
test_unsqueeze | 0.1786ms | 89.1765μs | 11.2137 KOps/s | 11.2476 KOps/s | |
test_split | 0.5000ms | 0.1870ms | 5.3465 KOps/s | 5.1563 KOps/s | |
test_permute | 0.7479ms | 0.2235ms | 4.4748 KOps/s | 4.5954 KOps/s | |
test_stack | 27.2679ms | 23.5010ms | 42.5514 Ops/s | 40.6041 Ops/s | |
test_cat | 25.8057ms | 23.4635ms | 42.6193 Ops/s | 40.7704 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 34.6300μs | 10.8122μs | 92.4877 KOps/s | 92.5624 KOps/s | |
test_plain_set_stack_nested | 34.8400μs | 10.8999μs | 91.7439 KOps/s | 91.9987 KOps/s | |
test_plain_set_nested_inplace | 41.9300μs | 11.7965μs | 84.7708 KOps/s | 85.1144 KOps/s | |
test_plain_set_stack_nested_inplace | 35.9410μs | 11.7547μs | 85.0724 KOps/s | 85.3195 KOps/s | |
test_items | 23.5910μs | 2.9018μs | 344.6192 KOps/s | 341.0243 KOps/s | |
test_items_nested | 0.3740ms | 0.3200ms | 3.1251 KOps/s | 3.0954 KOps/s | |
test_items_nested_locked | 0.3669ms | 0.3212ms | 3.1136 KOps/s | 3.0740 KOps/s | |
test_items_nested_leaf | 94.4110μs | 59.1904μs | 16.8946 KOps/s | 16.8243 KOps/s | |
test_items_stack_nested | 0.3564ms | 0.3192ms | 3.1325 KOps/s | 3.1124 KOps/s | |
test_items_stack_nested_leaf | 88.4410μs | 60.0535μs | 16.6518 KOps/s | 16.8351 KOps/s | |
test_items_stack_nested_locked | 0.3505ms | 0.3217ms | 3.1090 KOps/s | 3.0907 KOps/s | |
test_keys | 27.1310μs | 3.4818μs | 287.2097 KOps/s | 288.9295 KOps/s | |
test_keys_nested | 0.1030ms | 71.7735μs | 13.9327 KOps/s | 13.9113 KOps/s | |
test_keys_nested_locked | 0.8214ms | 76.8135μs | 13.0185 KOps/s | 12.8403 KOps/s | |
test_keys_nested_leaf | 0.1022ms | 63.1604μs | 15.8327 KOps/s | 15.7897 KOps/s | |
test_keys_stack_nested | 0.1079ms | 72.3021μs | 13.8309 KOps/s | 13.9348 KOps/s | |
test_keys_stack_nested_leaf | 0.1006ms | 63.3790μs | 15.7781 KOps/s | 15.8189 KOps/s | |
test_keys_stack_nested_locked | 0.1132ms | 77.5082μs | 12.9019 KOps/s | 12.9138 KOps/s | |
test_values | 5.5502μs | 0.8458μs | 1.1823 MOps/s | 1.1826 MOps/s | |
test_values_nested | 59.4400μs | 31.6212μs | 31.6243 KOps/s | 31.3617 KOps/s | |
test_values_nested_locked | 54.8500μs | 32.7165μs | 30.5656 KOps/s | 30.5257 KOps/s | |
test_values_nested_leaf | 61.9510μs | 34.1449μs | 29.2869 KOps/s | 29.1836 KOps/s | |
test_values_stack_nested | 59.9310μs | 32.2501μs | 31.0077 KOps/s | 31.1229 KOps/s | |
test_values_stack_nested_leaf | 57.1410μs | 34.6093μs | 28.8940 KOps/s | 29.1327 KOps/s | |
test_values_stack_nested_locked | 60.5210μs | 32.9488μs | 30.3501 KOps/s | 30.2173 KOps/s | |
test_membership | 2.2190μs | 0.5307μs | 1.8842 MOps/s | 1.8799 MOps/s | |
test_membership_nested | 22.6905μs | 1.9072μs | 524.3300 KOps/s | 521.9835 KOps/s | |
test_membership_nested_leaf | 15.4250μs | 1.9292μs | 518.3443 KOps/s | 519.5267 KOps/s | |
test_membership_stacked_nested | 24.7510μs | 1.9546μs | 511.6089 KOps/s | 496.9881 KOps/s | |
test_membership_stacked_nested_leaf | 34.5810μs | 1.9503μs | 512.7419 KOps/s | 499.8167 KOps/s | |
test_membership_nested_last | 26.8500μs | 2.8612μs | 349.5076 KOps/s | 348.1006 KOps/s | |
test_membership_nested_leaf_last | 31.8510μs | 2.9065μs | 344.0592 KOps/s | 349.9412 KOps/s | |
test_membership_stacked_nested_last | 33.4810μs | 5.8603μs | 170.6384 KOps/s | 343.8574 KOps/s | |
test_membership_stacked_nested_leaf_last | 26.4410μs | 5.8145μs | 171.9824 KOps/s | 348.3986 KOps/s | |
test_nested_getleaf | 36.0910μs | 6.3228μs | 158.1590 KOps/s | 157.3946 KOps/s | |
test_nested_get | 27.0410μs | 6.0503μs | 165.2799 KOps/s | 164.1901 KOps/s | |
test_stacked_getleaf | 32.0410μs | 6.4036μs | 156.1626 KOps/s | 157.0537 KOps/s | |
test_stacked_get | 31.0300μs | 6.0323μs | 165.7730 KOps/s | 165.7925 KOps/s | |
test_nested_getitemleaf | 26.6410μs | 6.5056μs | 153.7141 KOps/s | 154.8499 KOps/s | |
test_nested_getitem | 36.2510μs | 6.1113μs | 163.6325 KOps/s | 162.7739 KOps/s | |
test_stacked_getitemleaf | 31.7810μs | 6.4678μs | 154.6124 KOps/s | 153.1855 KOps/s | |
test_stacked_getitem | 26.9810μs | 6.1527μs | 162.5312 KOps/s | 163.2015 KOps/s | |
test_lock_nested | 0.8147ms | 0.3728ms | 2.6826 KOps/s | 2.6183 KOps/s | |
test_lock_stack_nested | 0.4229ms | 0.3438ms | 2.9086 KOps/s | 2.8971 KOps/s | |
test_unlock_nested | 0.6296ms | 0.3074ms | 3.2527 KOps/s | 3.2436 KOps/s | |
test_unlock_stack_nested | 0.3093ms | 0.2746ms | 3.6420 KOps/s | 3.6024 KOps/s | |
test_flatten_speed | 0.1205ms | 74.0250μs | 13.5090 KOps/s | 13.5285 KOps/s | |
test_unflatten_speed | 0.3306ms | 0.2982ms | 3.3533 KOps/s | 3.3020 KOps/s | |
test_common_ops | 1.5759ms | 0.5823ms | 1.7173 KOps/s | 1.7002 KOps/s | |
test_creation | 0.1878ms | 1.5047μs | 664.6024 KOps/s | 673.3490 KOps/s | |
test_creation_empty | 33.0310μs | 7.4053μs | 135.0381 KOps/s | 137.4276 KOps/s | |
test_creation_nested_1 | 41.2810μs | 8.9882μs | 111.2573 KOps/s | 111.7356 KOps/s | |
test_creation_nested_2 | 44.0100μs | 11.3955μs | 87.7542 KOps/s | 88.2602 KOps/s | |
test_clone | 0.1335ms | 10.3615μs | 96.5113 KOps/s | 90.7138 KOps/s | |
test_getitem[int] | 1.6779ms | 10.4611μs | 95.5920 KOps/s | 90.7461 KOps/s | |
test_getitem[slice_int] | 0.1172ms | 20.3675μs | 49.0978 KOps/s | 46.5288 KOps/s | |
test_getitem[range] | 0.1492ms | 38.2396μs | 26.1509 KOps/s | 25.5049 KOps/s | |
test_getitem[tuple] | 0.1141ms | 17.9595μs | 55.6808 KOps/s | 54.0315 KOps/s | |
test_getitem[list] | 0.1432ms | 33.5663μs | 29.7918 KOps/s | 29.3804 KOps/s | |
test_setitem_dim[int] | 53.8610μs | 18.4730μs | 54.1331 KOps/s | 50.3596 KOps/s | |
test_setitem_dim[slice_int] | 58.5910μs | 37.6769μs | 26.5415 KOps/s | 25.8187 KOps/s | |
test_setitem_dim[range] | 87.8210μs | 53.9517μs | 18.5351 KOps/s | 18.6205 KOps/s | |
test_setitem_dim[tuple] | 57.4210μs | 31.6267μs | 31.6188 KOps/s | 30.6865 KOps/s | |
test_setitem | 0.1349ms | 14.4711μs | 69.1033 KOps/s | 65.1007 KOps/s | |
test_set | 0.1550ms | 14.1215μs | 70.8141 KOps/s | 67.4960 KOps/s | |
test_set_shared | 1.6281ms | 0.1479ms | 6.7613 KOps/s | 6.7773 KOps/s | |
test_update | 0.4457ms | 16.5647μs | 60.3694 KOps/s | 57.6184 KOps/s | |
test_update_nested | 0.1280ms | 21.4207μs | 46.6838 KOps/s | 45.7586 KOps/s | |
test_update__nested | 1.2636ms | 24.8657μs | 40.2161 KOps/s | 39.6401 KOps/s | |
test_set_nested | 0.1302ms | 15.3715μs | 65.0553 KOps/s | 62.5005 KOps/s | |
test_set_nested_new | 0.1249ms | 17.6343μs | 56.7078 KOps/s | 53.7903 KOps/s | |
test_select | 0.1331ms | 29.0222μs | 34.4564 KOps/s | 33.2391 KOps/s | |
test_select_nested | 70.6510μs | 42.7005μs | 23.4189 KOps/s | 22.9885 KOps/s | |
test_exclude_nested | 88.5110μs | 59.7424μs | 16.7385 KOps/s | 16.7147 KOps/s | |
test_empty[True] | 0.2959ms | 0.2591ms | 3.8600 KOps/s | 3.8179 KOps/s | |
test_empty[False] | 3.1651μs | 0.7443μs | 1.3435 MOps/s | 1.3617 MOps/s | |
test_to | 84.6310μs | 56.3023μs | 17.7613 KOps/s | 18.0894 KOps/s | |
test_to_nonblocking | 1.0336ms | 46.0479μs | 21.7165 KOps/s | 21.3551 KOps/s | |
test_unbind_speed | 0.2870ms | 0.2329ms | 4.2944 KOps/s | 4.2045 KOps/s | |
test_unbind_speed_stack0 | 0.2837ms | 0.2293ms | 4.3615 KOps/s | 4.2537 KOps/s | |
test_unbind_speed_stack1 | 95.8547ms | 0.6507ms | 1.5369 KOps/s | 1.5115 KOps/s | |
test_split | 97.8094ms | 1.5711ms | 636.4985 Ops/s | 613.3760 Ops/s | |
test_chunk | 99.9002ms | 1.7080ms | 585.4820 Ops/s | 561.5600 Ops/s | |
test_consolidate[False-None] | 3.1838ms | 2.6416ms | 378.5603 Ops/s | 385.0178 Ops/s | |
test_consolidate[default-None] | 1.7295ms | 1.6772ms | 596.2204 Ops/s | 592.9537 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.7909ms | 1.7132ms | 583.6981 Ops/s | 580.6406 Ops/s | |
test_consolidate_njt[False-None] | 6.9814ms | 6.6824ms | 149.6476 Ops/s | 148.7582 Ops/s | |
test_to[False-False-None] | 1.7947ms | 1.6760ms | 596.6628 Ops/s | 607.8715 Ops/s | |
test_to[True-False-None] | 1.4977ms | 1.2654ms | 790.2646 Ops/s | 758.3347 Ops/s | |
test_to[within-False-None] | 4.1305ms | 4.0101ms | 249.3676 Ops/s | 179.2276 Ops/s | |
test_to[True-default-None] | 5.4057ms | 5.1420ms | 194.4758 Ops/s | 183.7736 Ops/s | |
test_to_njt[False-False-None] | 7.1189ms | 7.0431ms | 141.9834 Ops/s | 137.0739 Ops/s | |
test_to_njt[True-False-None] | 6.0424ms | 5.5869ms | 178.9917 Ops/s | 173.2458 Ops/s | |
test_to_njt[within-False-None] | 12.4466ms | 12.3602ms | 80.9051 Ops/s | 77.2759 Ops/s | |
test_creation[device0] | 0.5366ms | 79.8593μs | 12.5220 KOps/s | 11.8364 KOps/s | |
test_creation_from_tensor | 0.5243ms | 83.0774μs | 12.0370 KOps/s | 11.5125 KOps/s | |
test_add_one[memmap_tensor0] | 0.3906ms | 6.7304μs | 148.5797 KOps/s | 138.1429 KOps/s | |
test_contiguous[memmap_tensor0] | 2.4481μs | 0.4297μs | 2.3271 MOps/s | 2.4378 MOps/s | |
test_stack[memmap_tensor0] | 38.6500μs | 4.3631μs | 229.1966 KOps/s | 214.8246 KOps/s | |
test_memmaptd_index | 1.7622ms | 0.2496ms | 4.0061 KOps/s | 3.9161 KOps/s | |
test_memmaptd_index_astensor | 0.6016ms | 0.3104ms | 3.2217 KOps/s | 3.2011 KOps/s | |
test_memmaptd_index_op | 0.3049s | 0.7517ms | 1.3304 KOps/s | 1.6556 KOps/s | |
test_serialize_model | 0.1322s | 0.1315s | 7.6049 Ops/s | 7.6315 Ops/s | |
test_serialize_model_pickle | 1.3779s | 1.1921s | 0.8388 Ops/s | 0.8253 Ops/s | |
test_serialize_weights | 0.1308s | 0.1297s | 7.7074 Ops/s | 7.7061 Ops/s | |
test_serialize_weights_returnearly | 0.3096s | 53.3619ms | 18.7400 Ops/s | 15.0601 Ops/s | |
test_serialize_weights_pickle | 1.3763s | 1.1906s | 0.8399 Ops/s | 0.8439 Ops/s | |
test_reshape_pytree | 55.9710μs | 23.0459μs | 43.3916 KOps/s | 43.5660 KOps/s | |
test_reshape_td | 57.5110μs | 27.2235μs | 36.7330 KOps/s | 36.5968 KOps/s | |
test_view_pytree | 56.7100μs | 23.1538μs | 43.1895 KOps/s | 43.7558 KOps/s | |
test_view_td | 65.1610μs | 30.4041μs | 32.8903 KOps/s | 33.4658 KOps/s | |
test_unbind_pytree | 61.2710μs | 29.1559μs | 34.2984 KOps/s | 34.8368 KOps/s | |
test_unbind_td | 0.7422ms | 37.2165μs | 26.8698 KOps/s | 27.5057 KOps/s | |
test_split_pytree | 65.3410μs | 30.7847μs | 32.4836 KOps/s | 32.2996 KOps/s | |
test_split_td | 0.9543ms | 40.8518μs | 24.4787 KOps/s | 24.5576 KOps/s | |
test_add_pytree | 72.2610μs | 34.8921μs | 28.6598 KOps/s | 27.5463 KOps/s | |
test_add_td | 97.1620μs | 48.0080μs | 20.8299 KOps/s | 20.5762 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1735ms | 0.1233ms | 8.1117 KOps/s | 7.9478 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2201ms | 0.1266ms | 7.9014 KOps/s | 7.7347 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1395ms | 97.9494μs | 10.2094 KOps/s | 9.9313 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.2886ms | 0.1501ms | 6.6608 KOps/s | 6.6369 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1621ms | 29.7963μs | 33.5612 KOps/s | 42.3799 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 65.6810μs | 27.5494μs | 36.2984 KOps/s | 36.7671 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2938ms | 66.0390μs | 15.1426 KOps/s | 15.1893 KOps/s | |
test_compile_copy_nested[pytree-eager] | 88.7610μs | 51.5865μs | 19.3849 KOps/s | 19.6963 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1897ms | 0.1440ms | 6.9456 KOps/s | 7.0135 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3009ms | 0.2106ms | 4.7490 KOps/s | 4.8261 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1358ms | 99.5475μs | 10.0455 KOps/s | 10.1312 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1087ms | 51.6660μs | 19.3551 KOps/s | 18.8968 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1861ms | 0.1381ms | 7.2415 KOps/s | 7.2877 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5304ms | 0.4788ms | 2.0886 KOps/s | 2.0569 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3684ms | 0.2511ms | 3.9826 KOps/s | 4.0108 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2066ms | 0.1448ms | 6.9071 KOps/s | 6.9432 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1418ms | 62.2211μs | 16.0717 KOps/s | 15.1530 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1417ms | 99.8203μs | 10.0180 KOps/s | 10.1239 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4492ms | 0.4035ms | 2.4780 KOps/s | 2.4285 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1861ms | 0.1348ms | 7.4180 KOps/s | 7.2575 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 50.2000μs | 19.2531μs | 51.9398 KOps/s | 51.3543 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 66.8310μs | 28.0226μs | 35.6855 KOps/s | 37.4314 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1033ms | 71.2474μs | 14.0356 KOps/s | 14.1827 KOps/s | |
test_compile_copy_flat[pytree-eager] | 80.7110μs | 52.5279μs | 19.0375 KOps/s | 19.4100 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6504ms | 0.3978ms | 2.5141 KOps/s | 2.2112 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.7571ms | 2.6006ms | 384.5334 Ops/s | 374.0492 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6179ms | 0.4391ms | 2.2775 KOps/s | 2.2486 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.8856ms | 2.6485ms | 377.5705 Ops/s | 382.9704 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1828ms | 0.1183ms | 8.4548 KOps/s | 8.6960 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5646ms | 83.1746μs | 12.0229 KOps/s | 12.4444 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1725ms | 0.1071ms | 9.3371 KOps/s | 9.4070 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1082ms | 67.2436μs | 14.8713 KOps/s | 14.2975 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1664ms | 0.1119ms | 8.9352 KOps/s | 9.3025 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1264ms | 70.4240μs | 14.1997 KOps/s | 14.5207 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1478ms | 0.1033ms | 9.6787 KOps/s | 9.4940 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1483ms | 16.9867μs | 58.8698 KOps/s | 55.1549 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2073ms | 97.5830μs | 10.2477 KOps/s | 10.2094 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 52.4700μs | 16.2219μs | 61.6450 KOps/s | 60.1562 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1497ms | 98.3406μs | 10.1687 KOps/s | 10.1471 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 51.9610μs | 16.0168μs | 62.4344 KOps/s | 60.3092 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1471ms | 0.1064ms | 9.3948 KOps/s | 9.7294 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5986ms | 16.7641μs | 59.6512 KOps/s | 55.3629 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1543ms | 0.1031ms | 9.7001 KOps/s | 10.1662 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 47.8210μs | 16.1230μs | 62.0233 KOps/s | 60.3895 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1410ms | 97.6983μs | 10.2356 KOps/s | 10.1645 KOps/s | |
test_compile_indexing[int-pytree-eager] | 57.4110μs | 16.0389μs | 62.3486 KOps/s | 60.8096 KOps/s | |
test_mod_add[eager] | 71.0510μs | 31.5734μs | 31.6722 KOps/s | 31.4615 KOps/s | |
test_mod_add[compile] | 0.1253ms | 79.4239μs | 12.5907 KOps/s | 12.6180 KOps/s | |
test_mod_add[compile-overhead] | 0.3349ms | 0.1710ms | 5.8479 KOps/s | 5.7436 KOps/s | |
test_mod_wrap[eager] | 0.3225ms | 0.2412ms | 4.1456 KOps/s | 4.0749 KOps/s | |
test_mod_wrap[compile] | 1.6065ms | 0.2862ms | 3.4939 KOps/s | 3.4010 KOps/s | |
test_mod_wrap[compile-overhead] | 6.9801ms | 3.7171ms | 269.0247 Ops/s | 267.6315 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.6607ms | 1.4773ms | 676.8950 Ops/s | 703.2336 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.3981ms | 1.2744ms | 784.7023 Ops/s | 725.8378 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.4062ms | 0.9319ms | 1.0731 KOps/s | 964.5683 Ops/s | |
test_seq_add[eager] | 0.2057ms | 96.4828μs | 10.3645 KOps/s | 9.5215 KOps/s | |
test_seq_add[compile] | 0.1571ms | 88.3307μs | 11.3211 KOps/s | 11.0874 KOps/s | |
test_seq_add[compile-overhead] | 0.1954ms | 0.1363ms | 7.3376 KOps/s | 7.7300 KOps/s | |
test_seq_wrap[eager] | 0.5504ms | 0.3799ms | 2.6323 KOps/s | 2.4561 KOps/s | |
test_seq_wrap[compile] | 0.3487ms | 0.3007ms | 3.3256 KOps/s | 3.2602 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3155ms | 0.2334ms | 4.2841 KOps/s | 4.4278 KOps/s | |
test_func_call_runtime[False-eager] | 0.9107ms | 0.7838ms | 1.2758 KOps/s | 1.2540 KOps/s | |
test_func_call_runtime[False-compile] | 0.9991ms | 0.7827ms | 1.2777 KOps/s | 1.3206 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4130ms | 0.3676ms | 2.7203 KOps/s | 2.7255 KOps/s | |
test_func_call_runtime[True-eager] | 1.0352ms | 0.8997ms | 1.1114 KOps/s | 1.1059 KOps/s | |
test_func_call_runtime[True-compile] | 0.8413ms | 0.7656ms | 1.3062 KOps/s | 1.2952 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4353ms | 0.3898ms | 2.5656 KOps/s | 2.5879 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8118ms | 0.7368ms | 1.3573 KOps/s | 1.3002 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.1188ms | 0.7608ms | 1.3143 KOps/s | 1.3175 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4268ms | 0.3702ms | 2.7011 KOps/s | 2.7064 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0997ms | 0.9951ms | 1.0049 KOps/s | 986.5199 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.8862ms | 0.7961ms | 1.2561 KOps/s | 1.2324 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4657ms | 0.4171ms | 2.3974 KOps/s | 2.4077 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5906ms | 2.0842ms | 479.8072 Ops/s | 477.8236 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8809ms | 0.8084ms | 1.2370 KOps/s | 1.1899 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4568ms | 0.4153ms | 2.4080 KOps/s | 2.4096 KOps/s | |
test_distributed | 2.8720ms | 0.2227ms | 4.4910 KOps/s | 8.8720 KOps/s | |
test_tdmodule | 26.7300μs | 14.1270μs | 70.7865 KOps/s | 71.2420 KOps/s | |
test_tdmodule_dispatch | 0.3043ms | 28.6272μs | 34.9318 KOps/s | 37.1927 KOps/s | |
test_tdseq | 33.3800μs | 15.3617μs | 65.0971 KOps/s | 65.2183 KOps/s | |
test_tdseq_dispatch | 50.5610μs | 31.4624μs | 31.7839 KOps/s | 33.3841 KOps/s | |
test_instantiation_functorch | 1.6509ms | 1.5730ms | 635.7269 Ops/s | 635.9201 Ops/s | |
test_exec_functorch | 0.1825ms | 0.1463ms | 6.8368 KOps/s | 6.5779 KOps/s | |
test_exec_functional_call | 0.1890ms | 0.1409ms | 7.0992 KOps/s | 6.8698 KOps/s | |
test_exec_td_decorator | 0.3757ms | 0.1862ms | 5.3714 KOps/s | 5.2355 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.7974ms | 0.6742ms | 1.4831 KOps/s | 1.4828 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8726ms | 0.6725ms | 1.4871 KOps/s | 1.4836 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.6976ms | 0.5911ms | 1.6919 KOps/s | 1.6790 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.6998ms | 0.5936ms | 1.6846 KOps/s | 1.6783 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.1964ms | 19.1202ms | 52.3006 Ops/s | 52.2491 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.8021ms | 19.2066ms | 52.0653 Ops/s | 52.1819 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.1409ms | 19.0703ms | 52.4377 Ops/s | 52.3623 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.1013ms | 19.0176ms | 52.5830 Ops/s | 52.0649 Ops/s | |
test_to_module_speed[True] | 1.0553ms | 0.9448ms | 1.0584 KOps/s | 1.0560 KOps/s | |
test_to_module_speed[False] | 1.3755ms | 0.9399ms | 1.0640 KOps/s | 1.0791 KOps/s | |
test_tc_init | 61.4610μs | 33.5841μs | 29.7760 KOps/s | 27.8993 KOps/s | |
test_tc_init_nested | 0.1036ms | 71.8143μs | 13.9248 KOps/s | 13.8094 KOps/s | |
test_tc_first_layer_tensor | 10.5316μs | 0.7515μs | 1.3306 MOps/s | 1.3338 MOps/s | |
test_tc_first_layer_nontensor | 24.6210μs | 2.4136μs | 414.3217 KOps/s | 407.7908 KOps/s | |
test_tc_second_layer_tensor | 11.6133μs | 1.5479μs | 646.0347 KOps/s | 653.3888 KOps/s | |
test_tc_second_layer_nontensor | 0.1317ms | 3.2176μs | 310.7884 KOps/s | 310.7626 KOps/s | |
test_unbind | 0.2252s | 9.8167ms | 101.8669 Ops/s | 98.3818 Ops/s | |
test_full_like | 10.1271ms | 9.4765ms | 105.5242 Ops/s | 103.4456 Ops/s | |
test_zeros_like | 9.5422ms | 7.2916ms | 137.1433 Ops/s | 136.6968 Ops/s | |
test_ones_like | 4.9472ms | 4.1746ms | 239.5428 Ops/s | 227.4156 Ops/s | |
test_clone | 7.7082ms | 6.5736ms | 152.1230 Ops/s | 144.1428 Ops/s | |
test_squeeze | 59.5300μs | 9.6850μs | 103.2526 KOps/s | 103.2331 KOps/s | |
test_unsqueeze | 0.1191ms | 71.1493μs | 14.0549 KOps/s | 14.0219 KOps/s | |
test_split | 0.4173ms | 0.1553ms | 6.4409 KOps/s | 6.2813 KOps/s | |
test_permute | 0.2243ms | 0.1749ms | 5.7184 KOps/s | 5.6075 KOps/s | |
test_stack | 53.7709ms | 51.5892ms | 19.3839 Ops/s | 19.2069 Ops/s | |
test_cat | 52.0024ms | 51.2311ms | 19.5194 Ops/s | 19.3734 Ops/s |
vmoens
added
enhancement
New feature or request
Refactor
Refactoring code - not a new feature
labels
Nov 22, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
Refactor
Refactoring code - not a new feature
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):