arxiv:2601.02151
diaomuxi
diaomuxi
AI & ML interests
LLM & MLLM
Recent Activity
upvoted
a
paper
about 21 hours ago
EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis
authored
a paper
5 days ago
CineTechBench: A Benchmark for Cinematographic Technique Understanding
and Generation
authored
a paper
5 days ago
OJBench: A Competition Level Code Benchmark For Large Language Models