Creating CoreML Apple Neural Engine Models

Currently all of these models would have been converted from openelm-coreml.py
Review ops, layers, precision for a model
Review apple/ml-recurrent-drafter
- modeling_llama.py
- this model also seems to be an ANE optimized Llama with the ANE Principles being implemented
  - lines 161 and 162 deal with key_states and value_states
  - Class LlamaAttention
    - key_states = torch.repeat_interleave(key_states, dim=1, repeats=self.n_kv_groups)
    - value_states = torch.repeat_interleave(value_states, dim=1, repeats=self.n_kv_groups)
Review chunk_mlprogram.py (changed from apple/ml-stable-diffusion)
- Optimize for chunking text LLMs
- needs to check PSNR
  - random_gen_input_feature_type func is not working due to the model being converted, not properly displaying a value type to let the func know how to generate those input features (this seems to be the issue)
- program does work

Ways to View Layers, OPs, & Precision

The differences: how they get info, how they display it, and environment packages
smpanaro/CoreMLInspect
- this would work basically all around in any env
layer-iteration.py
- this requires something similar to ml-explore/mlx-examples env
- due to missing PIL package, I had issues using my python venv

CoreMLInspect Models - Layers, OPs, & Precision

layer-iteration.py Model - Layers, OPs, & Precision

OpenELM-270M-Instruct
OpenELM-1B-Instruct (may not come, have to determine if RAM or Storage Issue)

Performance Tests

OpenELM-270M-Instruct

Chunked

anthonymikinka/OpenELM-270M-Instruct-128-FP16ComputePrecisionv2_chunked_pipeline

OpenELM-1B-Instruct

anthonymikinka/OpenELM-1_1B-Instruct-128-FP16ComputePrecision_v2

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Model Performance Tests		Model Performance Tests
OpenELM-270M-Instruct-128-FP16ComputePrecision.mlpackage		OpenELM-270M-Instruct-128-FP16ComputePrecision.mlpackage
CoreMLInspect-OpenELM-1B-Instruct-Compiled-Model-CPU-NE.txt		CoreMLInspect-OpenELM-1B-Instruct-Compiled-Model-CPU-NE.txt
CoreMLInspect-OpenELM-270M-Instruct-Compiled-Model-CPU-NE.txt		CoreMLInspect-OpenELM-270M-Instruct-Compiled-Model-CPU-NE.txt
OpenELM-270M-Instruct-128-FP16ComputePrecisoinv2.txt		OpenELM-270M-Instruct-128-FP16ComputePrecisoinv2.txt
README.md		README.md
chunk_mlprogram.py		chunk_mlprogram.py
layer-iteration.py		layer-iteration.py
openelm-coreml.py		openelm-coreml.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Creating CoreML Apple Neural Engine Models

Ways to View Layers, OPs, & Precision

CoreMLInspect Models - Layers, OPs, & Precision

layer-iteration.py Model - Layers, OPs, & Precision

Performance Tests

OpenELM-270M-Instruct

Chunked

OpenELM-1B-Instruct

Palttized

About

Releases

Packages

Languages

antmikinka/swift-transformers-test

Folders and files

Latest commit

History

Repository files navigation

Creating CoreML Apple Neural Engine Models

Ways to View Layers, OPs, & Precision

CoreMLInspect Models - Layers, OPs, & Precision

layer-iteration.py Model - Layers, OPs, & Precision

Performance Tests

OpenELM-270M-Instruct

Chunked

OpenELM-1B-Instruct

Palttized

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages