Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ipex vllm mainline #12157

Open
wants to merge 6 commits into
base: main
Choose a base branch
from
Open

Commits on Aug 9, 2024

  1. Enable single card sync engine

    gc-fu committed Aug 9, 2024
    Configuration menu
    Copy the full SHA
    cf3f33b View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    232444e View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    95bbb29 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    8d700fc View commit details
    Browse the repository at this point in the history
  5. Remove duplicate layer

    gc-fu committed Aug 9, 2024
    Configuration menu
    Copy the full SHA
    1296e5a View commit details
    Browse the repository at this point in the history
  6. LLM: Update vLLM to v0.5.4 (intel-analytics#11746)

    * Enable single card sync engine
    
    * enable ipex-llm optimizations for vllm
    
    * enable optimizations for lm_head
    
    * Fix chatglm multi-reference problem
    
    * update 0.5.4 api_server
    
    * add dockerfile
    
    * fix
    
    * fix
    
    * refine
    
    * fix
    
    ---------
    
    Co-authored-by: gc-fu <guancheng.fu@intel.com>
    xiangyuT and gc-fu authored Aug 9, 2024
    Configuration menu
    Copy the full SHA
    8dc1ea0 View commit details
    Browse the repository at this point in the history