Trending Research

Stream-K: Work-centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU

NVIDIA/cutlass • 9 Jan 2023

We introduce Stream-K, a work-centric parallelization of matrix multiplication (GEMM) and related computations in dense linear algebra.

Data Structures and Algorithms Distributed, Parallel, and Cluster Computing

4,187

0.11 stars / hour

Paper
Code

Stein Variational Guided Model Predictive Path Integral Control: Proposal and Experiments with Fast Maneuvering Vehicles

kohonda/proj-svg_mppi • 20 Sep 2023

While MPPI can find a Gaussian-approximated optimal action distribution in closed form, i. e., without iterative solution updates, it struggles with the multimodality of the optimal distributions.

Robotics Information Theory Information Theory

0.10 stars / hour

Paper
Code

SimLOD: Simultaneous LOD Generation and Rendering

m-schuetz/simlod • 5 Oct 2023

Background: LOD construction is typically implemented as a preprocessing step that requires users to wait before they are able to view the results in real time.

Graphics

392

0.09 stars / hour

Paper
Code

MediaPipe: A Framework for Building Perception Pipelines

google/mediapipe • • 14 Jun 2019

A developer can use MediaPipe to build prototypes by combining existing perception components, to advance them to polished cross-platform applications and measure system performance and resource consumption on target platforms.

Distributed, Parallel, and Cluster Computing

24,782

0.08 stars / hour

Paper
Code

A Privacy-Preserving Healthcare Framework Using Hyperledger Fabric

hyperledger/fabric • 18 Nov 2020

Electronic health record (EHR) management systems require the adoption of effective technologies when health information is being exchanged.

Cryptography and Security

15,244

0.08 stars / hour

Paper
Code

AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario

pyannote/pyannote-audio • • 8 Apr 2021

This allows the researchers to explore different aspects in meeting processing, ranging from individual tasks such as speech front-end processing, speech recognition and speaker diarization, to multi-modality modeling and joint optimization of relevant tasks.

Sound Audio and Speech Processing

4,586

0.07 stars / hour

Paper
Code

SubPipe: A Submarine Pipeline Inspection Dataset for Segmentation and Visual-inertial Localization

remaro-network/subpipe-dataset • • 31 Jan 2024

This paper presents SubPipe, an underwater dataset for SLAM, object detection, and image segmentation.

Robotics

0.07 stars / hour

Paper
Code

Leveraging Content-based Features from Multiple Acoustic Models for Singing Voice Conversion

open-mmlab/Amphion • • 17 Oct 2023

It is yet to be explored what characteristics of content features from different acoustic models are, and whether integrating multiple content features can help each other.

Sound Audio and Speech Processing

3,460

0.06 stars / hour

Paper
Code

Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

open-mmlab/amphion • • 15 Dec 2023

Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Sound Audio and Speech Processing

3,460

0.06 stars / hour

Paper
Code

Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

coqui-ai/TTS • • 11 Jun 2021

Several recent end-to-end text-to-speech (TTS) models enabling single-stage training and parallel sampling have been proposed, but their sample quality does not match that of two-stage TTS systems.

Sound Audio and Speech Processing

26,832

0.06 stars / hour

Paper
Code