🔭 I'm currently working on LLM post-training for long-horizon reasoning and agentic tasks.
🌱 I'm currently learning inference acceleration and systems-level optimization (CUDA, vLLM, SGLang).
👯 I'm looking to collaborate on open-source AI infra projects (vLLM-Omni, SGLang RL/Diffusion).
🤔 I'm looking for help with scaling distributed RL training for large-scale agent benchmarks.
📫 How to reach me: Galleons777@gmail.com
Highlights
- Pro
Pinned Loading
-
vllm-omni-ljl
vllm-omni-ljl PublicForked from vllm-project/vllm-omni
A framework for efficient model inference with omni-modality models
Python
-
verl-omni-ljl
verl-omni-ljl PublicForked from verl-project/verl-omni
RL training framework for diffusion and omni-modality models
Python
-
-
Relax-ljl
Relax-ljl PublicForked from redai-infra/Relax
An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale
Python
-
-
verl_multi_turn
verl_multi_turn PublicForked from verl-project/verl
verl: Volcano Engine Reinforcement Learning for LLMs
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.


