Skip to content
View yang-su2000's full-sized avatar
👀
You found me!
👀
You found me!

Block or report yang-su2000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
yang-su2000/README.md

Welcome to Yang's GitHub

Hi there! I am currently working on LLMs alignment & agent systems at Qwen Team. Some topics I currently focus on

  • structured outputs & complex instruction following
  • personalization & user preference learning
  • agent alignment & self-play algorithms

I am happy to chat and discuss potential collaborations, feel free to reach out by

Linkedin Twitter Gmail WeChat

🌟 Studying Zone

(2024-) I am part-time collaborating with Cornell ICPC and Millennium to build LLMs for code and data generation.

  • This work is called ALICE (Aligning Language models for Interactive Code Execution), find more about it at alicellm.github.io.
  • ALICE is a meta-agent collaboration system that generates high-quality data through multi-turn interactions and feedback without human intervention.
  • It produces multimodal data with traces from agent strategies like ReAct and Reflexion, which are scarce but offer potential for aligning advanced LLMs.

(2023-2024) I I led the prior work of ALICE called Voice2Action with Cornell XRC, an Unity Package for real-time code execution in VR; and studied on large-scale generation augmented retrieval systems (opposed to RAG) at Cornell NLP.

(2021-2022) I interned on graph machine learning at AWS AI Lab and contributed to the open source Deep Graph Library.

👀 Chilling Zone

I like programming! I lead the "Cornell Tech" Group at Cornell ICPC and won the Top 20% in 2023 Regional!

LeetCode CodeForces Visitors

I enjoy cooking, listening to music of all forms, playing ping-pong, reading science fiction, and more!

⚡ Developing Zone

📈 "Accepted" Zone

Pinned Loading

  1. Voice2Action Voice2Action Public

    ALICE and its prior work, Voice2Action: Language Models as Agent for Efficient Real-Time Interaction in Virtual Reality

    C# 33 3

  2. RageAgainstThePixel/com.openai.unity RageAgainstThePixel/com.openai.unity Public

    A Non-Official OpenAI Rest Client for Unity (UPM)

    C# 484 66

  3. boson-ai/RPBench-Auto boson-ai/RPBench-Auto Public

    An automated pipeline for evaluating LLMs for role-playing.

    Python 147 8

  4. luyug/GradCache luyug/GradCache Public

    Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

    Python 368 24

  5. Authorship-Identification-with-NLP Authorship-Identification-with-NLP Public

    Large-scale user portarit ranking and generation augmented retrieval systems.

    Jupyter Notebook 5 1

  6. dmlc/dgl dmlc/dgl Public

    Python package built to ease deep learning on graph, on top of existing DL frameworks.

    Python 13.6k 3k