Monolith: Real Time Recommendation System With Collisionless Embedding Table

bytedance/monolith 16 Sep 2022

In this paper, we present Monolith, a system tailored for online training.

3,643
5.06 stars / hour

Automating the Search for Artificial Life with Foundation Models

sakanaai/asal 23 Dec 2024

With the recent Nobel Prize awarded for radical advances in protein discovery, foundation models (FMs) for exploring large combinatorial spaces promise to revolutionize many scientific fields.

Artificial Life Ingenuity

102
3.49 stars / hour

Arbitrary-steps Image Super-resolution via Diffusion Inversion

zsyoaoa/invsr 12 Dec 2024

This study presents a new image super-resolution (SR) technique based on diffusion inversion, aiming at harnessing the rich image priors encapsulated in large pre-trained diffusion models to improve SR performance.

Image Super-Resolution

599
2.22 stars / hour

CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up

huage001/clear 20 Dec 2024

Diffusion Transformers (DiT) have become a leading architecture in image generation.

8k Image Generation +1

108
2.09 stars / hour

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

answerdotai/modernbert 18 Dec 2024

Encoder-only transformer models such as BERT offer a great performance-size tradeoff for retrieval and classification tasks with respect to larger decoder-only models.

Decoder Retrieval

647
2.04 stars / hour

Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders

Seed3D/Dora arXiv 2024

However, the widely adopted uniform point sampling strategy in Shape VAE training often leads to a significant loss of geometric details, limiting the quality of shape reconstruction and downstream generation tasks.

3D Shape Modeling Benchmarking

54
1.70 stars / hour

DisPose: Disentangling Pose Guidance for Controllable Human Image Animation

lihxxx/dispose 12 Dec 2024

Specifically, we generate a dense motion field from a sparse motion field and the reference image, which provides region-level dense guidance while maintaining the generalization of the sparse pose control.

Image Animation

200
1.44 stars / hour

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

vision-x-nyu/thinking-in-space 18 Dec 2024

Humans possess the visual-spatial intelligence to remember spaces from sequential visual observations.

Question Answering Spatial Reasoning

199
1.35 stars / hour

Large Concept Models: Language Modeling in a Sentence Representation Space

facebookresearch/large_concept_model 11 Dec 2024

In this paper, we present an attempt at an architecture which operates on an explicit higher-level semantic representation, which we name a concept.

Language Modelling Sentence +3

759
1.33 stars / hour

On the Measure of Intelligence

fchollet/ARC 5 Nov 2019

To make deliberate progress towards more intelligent and more human-like artificial systems, we need to be following an appropriate feedback signal: we need to be able to define and evaluate intelligence in a way that enables comparisons between two systems, as well as comparisons with humans.

ARC Benchmarking

3,991
0.97 stars / hour