Trending Research

Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents

jennyzzt/dgm �� 29 May 2025

The G\"odel machine proposed a theoretical alternative: a self-improving AI that repeatedly modifies itself in a provably beneficial manner.

Meta-Learning

1,050

2.04 stars / hour

Paper
Code

AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning

inclusionai/areal • • 30 May 2025

Most existing large-scale RL systems for LLMs are synchronous, alternating generation and training in a batch setting where rollouts in each training batch are generated by the same model.

Math Reinforcement Learning (RL)

1,562

1.25 stars / hour

Paper
Code

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

paper2poster/paper2poster • 27 May 2025

To address this challenge, we introduce the first benchmark and metric suite for poster generation, which pairs recent conference papers with author-designed posters and evaluates outputs on (i)Visual Quality-semantic alignment with human posters, (ii)Textual Coherence-language fluency, (iii)Holistic Assessment-six fine-grained aesthetic and informational criteria scored by a VLM-as-judge, and notably (iv)PaperQuiz-the poster's ability to convey core paper content as measured by VLMs answering generated quizzes.

1,796

1.05 stars / hour

Paper
Code

BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model

bowang-lab/bioreason • • 29 May 2025

Unlocking deep, interpretable biological reasoning from complex genomic data is a major AI challenge hindering scientific discovery.

Large Language Model scientific discovery

152

0.88 stars / hour

Paper
Code

Emerging Properties in Unified Multimodal Pretraining

ByteDance-Seed/Bagel • • 20 May 2025

Unifying multimodal understanding and generation has shown impressive capabilities in cutting-edge proprietary systems.

Image Manipulation multimodal generation +1

3,861

0.86 stars / hour

Paper
Code

HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters

tencent-hunyuan/hunyuanvideo-avatar • • 26 May 2025

This ensures the dynamic motion and strong character consistency; (ii) An Audio Emotion Module (AEM) is introduced to extract and transfer the emotional cues from an emotion reference image to the target generated video, enabling fine-grained and accurate emotion style control; (iii) A Face-Aware Audio Adapter (FAA) is proposed to isolate the audio-driven character with latent-level face mask, enabling independent audio injection via cross-attention for multi-character scenarios.

Human Animation

986

0.81 stars / hour

Paper
Code

Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention

DreamTechAI/Direct3D-S2 • • 23 May 2025

Generating high-resolution 3D shapes using volumetric representations such as Signed Distance Functions (SDFs) presents substantial computational and memory challenges.

3D Generation 3D geometry +5

551

0.75 stars / hour

Paper
Code

RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination

microsoft/renderformer • • 28 May 2025

We present RenderFormer, a neural rendering pipeline that directly renders an image from a triangle-based representation of a scene with full global illumination effects and that does not require per-scene training or fine-tuning.

Neural Rendering

460

0.74 stars / hour

Paper
Code

AlphaEvolve: A Learning Framework to Discover Novel Alphas in Quantitative Investment

codelion/openevolve • 30 Mar 2021

In this paper, we introduce a new class of alphas to model scalar, vector, and matrix features which possess the strengths of these two existing classes.

AutoML Stock Prediction

2,263

0.71 stars / hour

Paper
Code

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

charlesq9/alita • 26 May 2025

For Maximal self-evolution, we enable the creativity of Alita by providing a suite of general-purpose components to autonomously construct, refine, and reuse external capabilities by generating task-related model context protocols (MCPs) from open source, which contributes to scalable agentic reasoning.

398

0.69 stars / hour

Paper
Code