Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fast path slice contiguous constant #137

Merged
merged 2 commits into from
Oct 8, 2024
Merged

Fast path slice contiguous constant #137

merged 2 commits into from
Oct 8, 2024

Conversation

wsmoses
Copy link
Member

@wsmoses wsmoses commented Oct 8, 2024

No description provided.

@wsmoses wsmoses merged commit 93655ef into main Oct 8, 2024
4 of 10 checks passed
@wsmoses wsmoses deleted the fpslice branch October 8, 2024 03:51
smjleo added a commit that referenced this pull request Oct 18, 2024
* Simplify path infra (#128)

* Simplify path infra

* fmt

* add utils

* Bump internals (#130)

* Bump internals

* fix

* fixup

* fix

* get out hlomodule from wrapper module

* construct analysis with test gpu specs

and broken dlopen because some random dependency found its way in

* fix build

* actually compute non-zero costs

* use analytical cost model

* dedup dependency for gpu

remove debug output

* Convolution support

* adding more multi patterns

* Attempt gpu ci fix (#125)

* Try local python usage

* Attempt gpu ci fix

* fix

* fix

* fix

* Update llama.py

* try cuda 12 pip

* fix

* fix libdevice

* cleanup

* Force gpu

* Update gpu_pipeline.yml

* Update gpu_pipeline.yml

* Update gpu_pipeline.yml

* Update test_utils.py

* Update gpu_pipeline.yml

* Update gpu_pipeline.yml

* Update gpu_pipeline.yml

* Update gpu_pipeline.yml

* Update gpu_pipeline.yml

* Update gpu_pipeline.yml

* Update gpu_pipeline.yml

* Update gpu_pipeline.yml

* Update gpu_pipeline.yml

* Update gpu_pipeline.yml

* Update test_utils.py

---------

Co-authored-by: William Moses <wsmoses@cyclops.juliacomputing.io>

* formatting and fix multi pattern rewrite

* add debug outputs

* try infering device description (broken on cpu, maybe it works on gpu)

* remove debug output for device desc

* clean up cost model, and make it depend on platform

* Adding resnet test and eqsat after jvp

* hugging face transformers vision transformer and resnet added

* Fast path slice contiguous constant (#137)

* Fast path slice contiguous constant

* fmt

* remove unneeded code in resnet

* fixed conv rewrite

* fixed next multi-pattern for conv

* turn off fusion costs by default

* build python 3.11 instead

* add batched matmul multi-pattern rewrite

* merging rewrites

* Adding back deleted

* bert huggingface added

* add matmul multi-pattern rewrite with different lhs contracting

* Add jaxmd tests (#136)

* Add jaxmd tests

* fix

* fix

* fix

* fix

* fixup

* fmt

* More ops

* fix

* fixup

* fixup

* fix

* cleanup

* fix

* fix

* fix

* fmt

* fix 312

* add reduce test

* red

* now as test

* fix

* cleanup

* jaxmd disable AD

* remove extra print

* no tpu

* Scatter activity test

* fix

* fix

* fix

* cleanup

* cleanup

* dynamic slice fwd

* dynamic update slice fwd

* Fix

* remove pyc

* add pyc

* fix

* fix

* fix

* fix

* fix

* fastsmpl

* fix

* cleanup

* cleanup

* fix

* final fix

* fix broken asserts

* fix contracting 2 rewrite

* adding mistral test

* Transpose batch (#138)

* ci: touch lock file

* try fixing ci

* turn off visibility checks

* try removing dependency?

* try adding redzone allocator

* changes

* try adding config cuda

* bump internals

* bump again

* use python 3.12

* remove crosstool in cuda config

* Maxtext (#139)

* Maxtext

* fixup

* fix

* try fix

* fix

* fix

* cleanup

* Update test-requirements.txt

* cleanup

* don't build rocm

* bring back crosstool

* add a bunch of bazel flags

* turn off tf nvcc clang

* device description fix

* update cuda/cudnn versions

* try printing libs

* revert

* does nvrtc work as deps?

* expose multi pattern rules

* put python in path

* add ortools as dep

* add eqsat env var for test

* ortools where are you???

* descent into madness

* please

* Update StableHLOAutoDiffOpInterfaceImpl.cpp

* a

* cleanup, and hopefully make it build on cyclops again

* get results csv from the right place

* add eqsat to maxtext test

* csv location

* get csv

---------

Co-authored-by: William Moses <gh@wsmoses.com>
Co-authored-by: jbachurski <kbachurski@gmail.com>
Co-authored-by: Arya Vohra <arya.vohra04@gmail.com>
Co-authored-by: William Moses <wsmoses@cyclops.juliacomputing.io>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant