TencentARC/TimeLens-8B
Video-Text-to-Text
•
9B
•
Updated
•
230
•
4
ARC mainly focuses on areas of computer vision, speech, and natural language processing, including speech/video generation, enhancement, retrieval, understanding, AutoML, etc. Considering research developments and industry trends, ARC consistently pursues exploration, innovation, and breakthroughs in technologies.
VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control
TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs