Full Publications/Events (14)

2023 (8)

Blog published on Medium: Faster Stable Diffusion Inference with Intel Extension for Transformers (July 2023)
Blog of Intel Developer News: The Moat Is Trust, Or Maybe Just Responsible AI (July 2023)
Blog of Intel Developer News: Create Your Own Custom Chatbot (July 2023)
Blog of Intel Developer News: Accelerate Llama 2 with Intel AI Hardware and Software Optimizations (July 2023)
Arxiv: An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs (June 2023)
Blog published on Medium: Simplify Your Custom Chatbot Deployment (June 2023)
Blog published on Medium: Create Your Own Custom Chatbot (April 2023)
Blog of Tech-Innovation Artificial-Intelligence(AI): Intel® Xeon® Processors Are Still the Only CPU With MLPerf Results, Raising the Bar By 5x - Intel Communities (April 2023)

2022 (5)

Blog published on Medium: MLefficiency — Optimizing transformer models for efficiency (Dec 2022)
NeurIPS'2022: Fast Distilbert on CPUs (Nov 2022)
NeurIPS'2022: QuaLA-MiniLM: a Quantized Length Adaptive MiniLM (Nov 2022)
Blog published by Cohere: Top NLP Papers—November 2022 (Nov 2022)
Blog published by Alibaba: Deep learning inference optimization for Address Purification (Aug 2022)

2021

NeurIPS'2021: Prune Once for All: Sparse Pre-Trained Language Models (Nov 2021)