What’s Happening in My Field
Flow Matching
StainFlow: Entity-Stain Tracking and Evidence Linking for Process Rewards in GUI Agents
2026-06-05
Audio ReasoningVISA: A Visual Information Strengthened Audio-Reasoning System for the Interspeech 2026 ARC Agent Track
2026-06-05
RL TrainingCross-Epoch Adaptive Rollout Optimization for RL Post-Training
2026-06-04
RL TrainingOpenSkill: Open-World Self-Evolution for LLM Agents
2026-06-04
ReasoningTest-Time Compute Scaling for ASR with Depth-Conditioned Looped Transformers
2026-06-03
ReasoningSCI-PRM: A Tool Aware Process Reward Model for Scientific Reasoning Verification
2026-06-03
Flow MatchingFlowPRO: Reward-Free Reinforced Fine-Tuning of Flow-Matching VLAs via Proximalized Preference Optimization
2026-06-03
MultimodalFood-R1: A Unified Multi-Task Food Vision-Language Model with Reinforcement Learning
2026-06-03
RL TrainingLibra: Efficient Resource Management for Agentic RL Post-Training
2026-06-02