Stream-K: Work-centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU

NVIDIA/cutlass 9 Jan 2023

We introduce Stream-K, a work-centric parallelization of matrix multiplication (GEMM) and related computations in dense linear algebra.

Data Structures and Algorithms Distributed, Parallel, and Cluster Computing

4,187
0.11 stars / hour

Stein Variational Guided Model Predictive Path Integral Control: Proposal and Experiments with Fast Maneuvering Vehicles

kohonda/proj-svg_mppi 20 Sep 2023

While MPPI can find a Gaussian-approximated optimal action distribution in closed form, i. e., without iterative solution updates, it struggles with the multimodality of the optimal distributions.

Robotics Information Theory Information Theory

22
0.10 stars / hour

SimLOD: Simultaneous LOD Generation and Rendering

m-schuetz/simlod 5 Oct 2023

Background: LOD construction is typically implemented as a preprocessing step that requires users to wait before they are able to view the results in real time.

Graphics

392
0.09 stars / hour

MediaPipe: A Framework for Building Perception Pipelines

google/mediapipe 14 Jun 2019

A developer can use MediaPipe to build prototypes by combining existing perception components, to advance them to polished cross-platform applications and measure system performance and resource consumption on target platforms.

Distributed, Parallel, and Cluster Computing

24,782
0.08 stars / hour

A Privacy-Preserving Healthcare Framework Using Hyperledger Fabric

hyperledger/fabric 18 Nov 2020

Electronic health record (EHR) management systems require the adoption of effective technologies when health information is being exchanged.

Cryptography and Security

15,244
0.08 stars / hour

AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario

pyannote/pyannote-audio 8 Apr 2021

This allows the researchers to explore different aspects in meeting processing, ranging from individual tasks such as speech front-end processing, speech recognition and speaker diarization, to multi-modality modeling and joint optimization of relevant tasks.

Sound Audio and Speech Processing

4,586
0.07 stars / hour

SubPipe: A Submarine Pipeline Inspection Dataset for Segmentation and Visual-inertial Localization

remaro-network/subpipe-dataset 31 Jan 2024

This paper presents SubPipe, an underwater dataset for SLAM, object detection, and image segmentation.

Robotics

14
0.07 stars / hour

Leveraging Content-based Features from Multiple Acoustic Models for Singing Voice Conversion

open-mmlab/Amphion 17 Oct 2023

It is yet to be explored what characteristics of content features from different acoustic models are, and whether integrating multiple content features can help each other.

Sound Audio and Speech Processing

3,460
0.06 stars / hour

Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

open-mmlab/amphion 15 Dec 2023

Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Sound Audio and Speech Processing

3,460
0.06 stars / hour

Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

coqui-ai/TTS 11 Jun 2021

Several recent end-to-end text-to-speech (TTS) models enabling single-stage training and parallel sampling have been proposed, but their sample quality does not match that of two-stage TTS systems.

Sound Audio and Speech Processing

26,832
0.06 stars / hour