auto-tuning

Here are 30 public repositories matching this topic...

ROCm / Tensile

Stretching GPU performance for GEMMs and tensor contractions.

python machine-learning amd gpu assembly opencl dnn matrix-multiplication neural-networks gpu-acceleration blas hip gpu-computing tensors tensor-contraction gemm radeon auto-tuning

Updated Nov 12, 2024
Python

intel / neural-compressor

Star

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

sparsity pruning quantization knowledge-distillation auto-tuning int8 low-precision quantization-aware-training post-training-quantization awq int4 large-language-models gptq smoothquant sparsegpt fp4 mxformat

Updated Nov 12, 2024
Python

KernelTuner / kernel_tuner

Star

Kernel Tuner

python c testing machine-learning cplusplus gpu optimization opencl cuda autotuning software-development opencl-kernels kernel-tuner cuda-kernels gpu-computing auto-tuning

Updated Nov 12, 2024
Python

Pool Manager dirancang untuk mengelola pooling objek secara efisien dalam aplikasi Anda. Dengan fitur-fitur seperti sharding, caching, auto-tuning, dan kebijakan eviksi, package ini membantu meningkatkan performa dan efisiensi penggunaan memori.

go caching performance concurrency memory-management auto-tuning pooling eviction-policy shrading

Updated Oct 27, 2024
Go

oracle / bpftune

Star

bpftune uses BPF to auto-tune Linux systems

linux ebpf bpf auto-tuning

Updated Oct 25, 2024
C

zwang4 / awesome-machine-learning-in-compilers

Star

Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation

machine-learning compiler parallel-computing parallelism artificial-intelligence operating-systems optimisation auto-tuning parallel-programming parallelisation multi-cores

Updated Oct 14, 2024

AutoTuningAssociation / autotuning_methodology

Star

This software package accompanies the paper "A Methodology for Comparing Auto-Tuning Optimization Algorithms" (https://doi.org/10.1016/j.future.2024.05.021), making the guidelines in the methodology easy to apply.

performance-metrics methodology optimization-algorithms auto-tuning performance-comparison performance-optimization

Updated Nov 7, 2024
Python

HAL-42 / AlchemyCat

Star

Alchemy Cat —— 🔥Config System for SOTA

config machine-learning computer-vision deep-learning auto-tuning parameter-tuning

Updated Aug 19, 2024
Python

NTNU-HPC-Lab / BAT

Star

A GPU benchmark suite for autotuners

benchmarking kernel hpc cuda autotuning bat benchmark-suite auto-tuning

Updated Feb 20, 2024
Cuda

addb-swstarlab / K2vTune

Star

K2vTune (A Workload-aware Configuration Tuning for RocksDB)

rocksdb machine-learning artificial-intelligence auto-tuning configuration-tuning

Updated Nov 15, 2023
Jupyter Notebook

kilitary / fann-related

Star

fann networks+forex + MILK + met8

js com forex rprop bat arj auto-tuning fann quickprop fann-networks lzh

Updated Aug 5, 2023
HTML

umayrh / sparktuner

Star

Autotuner for Spark applications

python spark apache-spark tuning tuning-parameters auto-tuning

Updated May 22, 2023
Python

polycloze / polycloze

Star

A self-hosted language learning website

flashcards language-learning vocabulary self-hosted spaced-repetition srs tatoeba auto-tuning vocabulary-learning language-course cloze-tests

Updated Jan 11, 2023
Go

CNugteren / CLTune

Sponsor

Star

CLTune: An automatic OpenCL & CUDA kernel tuner

opencl cuda tuner auto-tuning

Updated Dec 12, 2022
C++

tlc-pack / TLCBench

Star

Benchmark scripts for TVM

benchmark deep-learning auto-tuning tvm tuning-logs

Updated Mar 15, 2022
Python

sbu-fsl / kernel-ml

Star

Machine Learning Framework for Operating Systems - Brings ML to Linux kernel

machine-learning kernel-module linux-kernel operating-systems auto-tuning mlsys

Updated Dec 13, 2021
C

cornell-zhang / uptune

Star

A Generic Distributed Auto-Tuning Infrastructure

python distributed-systems cpp heuristics auto-tuning

Updated Jul 29, 2021
Python

ctuning / ck-crowdtuning

Sponsor

Star

Collective Knowledge crowd-tuning extension to let users crowdsource their experiments (using portable Collective Knowledge workflows) such as performance benchmarking, auto tuning and machine learning across diverse platforms with Linux, Windows, MacOS and Android provided by volunteers. Demo of DNN crowd-benchmarking and crowd-tuning:

iot machine-learning optimization collaboration collective-intelligence mobile-phones knowledge-sharing collective-knowledge auto-tuning mobile-devices crowdsource-experiments crowd-tuning crowd-benchmarking open-repository

Updated Jul 10, 2021
Python

SUSE / phoebe

Star

Phoebe

linux machine-learning artificial-intelligence systems self-healing auto-tuning

Updated May 24, 2021
C

weixingsun / jBProF

Star

ebpf profiler for jvm

profiler jvm latency breakpoint perf flamegraph jni ebpf bpf jvmti auto-tuning

Updated May 5, 2021
C++

Improve this page

Add a description, image, and links to the auto-tuning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the auto-tuning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

auto-tuning

Here are 30 public repositories matching this topic...

ROCm / Tensile

intel / neural-compressor

KernelTuner / kernel_tuner

hibbannn / pool-manager

oracle / bpftune

zwang4 / awesome-machine-learning-in-compilers

AutoTuningAssociation / autotuning_methodology

HAL-42 / AlchemyCat

NTNU-HPC-Lab / BAT

addb-swstarlab / K2vTune

kilitary / fann-related

umayrh / sparktuner

polycloze / polycloze

CNugteren / CLTune

tlc-pack / TLCBench

sbu-fsl / kernel-ml

cornell-zhang / uptune

ctuning / ck-crowdtuning

SUSE / phoebe

weixingsun / jBProF

Improve this page

Add this topic to your repo