Skip to content
View Galleons2029's full-sized avatar
  • Wuhan University
  • Beijing

Highlights

  • Pro

Block or report Galleons2029

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Galleons2029/README.md

💫 About Me:

🔭 I'm currently working on LLM post-training for long-horizon reasoning and agentic tasks.
🌱 I'm currently learning inference acceleration and systems-level optimization (CUDA, vLLM, SGLang).
👯 I'm looking to collaborate on open-source AI infra projects (vLLM-Omni, SGLang RL/Diffusion).
🤔 I'm looking for help with scaling distributed RL training for large-scale agent benchmarks.
📫 How to reach me: Galleons777@gmail.com

💻 Tech Stack:

Python Go Rust FastAPI Next JS Redis Postgres RabbitMQ PyTorch

✍️ Random Dev Quote

Pinned Loading

  1. vllm-omni-ljl vllm-omni-ljl Public

    Forked from vllm-project/vllm-omni

    A framework for efficient model inference with omni-modality models

    Python

  2. verl-omni-ljl verl-omni-ljl Public

    Forked from verl-project/verl-omni

    RL training framework for diffusion and omni-modality models

    Python

  3. Cascade-RAG Cascade-RAG Public

    Enterprise-level Reasoning RAG

    Python 5 1

  4. Relax-ljl Relax-ljl Public

    Forked from redai-infra/Relax

    An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale

    Python

  5. CS336-sp25 CS336-sp25 Public

    CS336-sp25 class project

    Python

  6. verl_multi_turn verl_multi_turn Public

    Forked from verl-project/verl

    verl: Volcano Engine Reinforcement Learning for LLMs

    Python