<abbr id="y2asm"></abbr><abbr id="y2asm"></abbr>
  • <code id="y2asm"></code>
    <code id="y2asm"></code>
  • <button id="y2asm"></button>
    <rt id="y2asm"></rt>

    Absolute Zero: Reinforced Self-play Reasoning with Zero Data

    LeapLabTHU/Absolute-Zero-Reasoner ? ? 6 May 2025

    Reinforcement learning with verifiable rewards (RLVR) has shown promise in enhancing the reasoning capabilities of large language models by learning directly from outcome-based rewards.

    Mathematical Reasoning

    385
    4.12 stars / hour

    PixelHacker: Image Inpainting with Structural and Semantic Consistency

    hustvl/PixelHacker ? 29 Apr 2025

    Specifically, we first construct a large dataset containing 14 million image-mask pairs by annotating foreground and background (potential 116 and 21 categories, respectively).

    Denoising Facial Inpainting

    245
    1.95 stars / hour

    Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

    maitrix-org/voila ? ? 5 May 2025

    A voice AI agent that blends seamlessly into daily life would interact with humans in an autonomous, real-time, and emotionally expressive manner.

    AI Agent Automatic Speech Recognition +4

    233
    1.82 stars / hour

    Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

    aidc-ai/awesome-unified-multimodal-models ? ? 5 May 2025

    Despite their respective successes, these two domains have evolved independently, leading to distinct architectural paradigms: While autoregressive-based architectures have dominated multimodal understanding, diffusion-based models have become the cornerstone of image generation.

    Survey Text-to-Image Generation

    79
    1.79 stars / hour

    LTX-Video: Realtime Video Latent Diffusion

    Lightricks/LTX-Video ? ? 30 Dec 2024

    To address this, our VAE decoder is tasked with both latent-to-pixel conversion and the final denoising step, producing the clean result directly in pixel space.

    Denoising Image to Video Generation

    4,360
    1.52 stars / hour

    WebThinker: Empowering Large Reasoning Models with Deep Research Capability

    ruc-nlpir/webthinker ? 30 Apr 2025

    Large reasoning models (LRMs), such as OpenAI-o1 and DeepSeek-R1, demonstrate impressive long-horizon reasoning capabilities.

    Navigate

    614
    1.33 stars / hour

    FastVLM: Efficient Vision Encoding for Vision Language Models

    apple/ml-fastvlm ? ? 17 Dec 2024

    At different operational resolutions, the vision encoder of a VLM can be optimized along two axes: reducing encoding latency and minimizing the number of visual tokens passed to the LLM, thereby lowering overall latency.

    227
    1.28 stars / hour

    Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents

    simular-ai/agent-s ? ? 1 Apr 2025

    Computer use agents automate digital tasks by directly interacting with graphical user interfaces (GUIs) on computers and mobile devices, offering significant potential to enhance human productivity by completing an open-ended space of user queries.

    AI Agent Task Planning

    4,514
    1.24 stars / hour

    LiftFeat: 3D Geometry-Aware Local Feature Matching

    lyp-deeplearning/liftfeat ? ? 6 May 2025

    We then design a 3D geometry-aware feature lifting module to fuse surface normal feature with raw 2D descriptor feature.

    3D geometry Homography Estimation +3

    63
    1.23 stars / hour

    PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding

    facebookresearch/perception_models ? ? 17 Apr 2025

    In this paper, we study building a Perception Language Model (PLM) in a fully open and reproducible framework for transparent research in image and video understanding.

    Video Question Answering Video Understanding

    961
    0.98 stars / hour
    主站蜘蛛池模板: 国产日韩一区二区三区| 五月婷婷在线视频| 国产成人综合美国十次| 国产鲁鲁视频在线观看| 天堂在线www资源在线下载| 无人在线观看视频高清视频8| 樱桃视频直播在线观看免费 | 日韩精品一区二区三区老鸭窝| www.nxgx| 中文字幕a∨在线乱码免费看| 久久精品人人爽人人爽| 亚洲欧美成人中文日韩电影| 伊人久久大香线蕉综合影院首页 | 狠狠色噜噜狠狠狠狠69| 色妞WW精品视频7777| 裸体跳舞XXXX裸体跳舞| 香蕉久久综合精品首页| 精品丝袜国产自在线拍亚洲| 黄色一级片在线播放| 鸥美一级黄色片| 色吊丝最新网站| 美女裸身正面无遮挡全身视频| 成人精品一区二区户外勾搭野战| 天堂网在线资源www最新版| 91在线亚洲综合在线| 777奇米四色| 2015天堂网| 亚洲国产老鸭窝一区二区三区| 77777亚洲午夜久久多喷| 91精品国产自产91精品| 337p日本欧洲亚洲大胆精品555588 | 色综合天天综合网站中国| 色综合小说久久综合图片| 老司机在线精品视频| 精品欧美军人同性videos| 精品久久久久久亚洲综合网| 瓮红电影三级在线播放| 欧美黑人乱大交| 欧美人善交videosg| 日韩精品人妻系列无码av东京| 日本精品少妇一区二区三区|