Trending Research

Monolith: Real Time Recommendation System With Collisionless Embedding Table

bytedance/monolith • • 16 Sep 2022

In this paper, we present Monolith, a system tailored for online training.

3,643

5.06 stars / hour

Paper
Code

Automating the Search for Artificial Life with Foundation Models

sakanaai/asal • • 23 Dec 2024

With the recent Nobel Prize awarded for radical advances in protein discovery, foundation models (FMs) for exploring large combinatorial spaces promise to revolutionize many scientific fields.

Artificial Life Ingenuity

102

3.49 stars / hour

Paper
Code

Arbitrary-steps Image Super-resolution via Diffusion Inversion

zsyoaoa/invsr • • 12 Dec 2024

This study presents a new image super-resolution (SR) technique based on diffusion inversion, aiming at harnessing the rich image priors encapsulated in large pre-trained diffusion models to improve SR performance.

Image Super-Resolution

599

2.22 stars / hour

Paper
Code

CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up

huage001/clear • • 20 Dec 2024

Diffusion Transformers (DiT) have become a leading architecture in image generation.

8k Image Generation +1

108

2.09 stars / hour

Paper
Code

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

answerdotai/modernbert • • 18 Dec 2024

Encoder-only transformer models such as BERT offer a great performance-size tradeoff for retrieval and classification tasks with respect to larger decoder-only models.

Decoder Retrieval

647

2.04 stars / hour

Paper
Code

Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders

Seed3D/Dora • arXiv 2024

However, the widely adopted uniform point sampling strategy in Shape VAE training often leads to a significant loss of geometric details, limiting the quality of shape reconstruction and downstream generation tasks.

3D Shape Modeling Benchmarking

1.70 stars / hour

Paper
Code

DisPose: Disentangling Pose Guidance for Controllable Human Image Animation

lihxxx/dispose • • 12 Dec 2024

Specifically, we generate a dense motion field from a sparse motion field and the reference image, which provides region-level dense guidance while maintaining the generalization of the sparse pose control.

Image Animation

200

1.44 stars / hour

Paper
Code

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

vision-x-nyu/thinking-in-space • • 18 Dec 2024

Humans possess the visual-spatial intelligence to remember spaces from sequential visual observations.

Question Answering Spatial Reasoning

199

1.35 stars / hour

Paper
Code

Large Concept Models: Language Modeling in a Sentence Representation Space

facebookresearch/large_concept_model • • 11 Dec 2024

In this paper, we present an attempt at an architecture which operates on an explicit higher-level semantic representation, which we name a concept.

Language Modelling Sentence +3

759

1.33 stars / hour

Paper
Code

On the Measure of Intelligence

fchollet/ARC • 5 Nov 2019

To make deliberate progress towards more intelligent and more human-like artificial systems, we need to be following an appropriate feedback signal: we need to be able to define and evaluate intelligence in a way that enables comparisons between two systems, as well as comparisons with humans.

ARC Benchmarking

3,991

0.97 stars / hour

Paper
Code