Audio dataset. It transforms raw audio sources into clean, curated, and captione...

Audio dataset. It transforms raw audio sources into clean, curated, and captioned segments specifically optimized for training the LTX-2 model in audio-only mode. Engineered for production sound classification and audio AI models. Custom Audio Classification Datasets for Sound AI Precisely labeled audio corpora across environmental sound, industrial noise, bioacoustics, and event detection. AudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube videos. Audio Segmenter & Captioner Pipeline. Nov 16, 2021 · A curated list of audio datasets for machine learning, contributed by the DagsHub community during the Hacktoberfest 2021. Jul 9, 2025 · About Dataset Overview: This dataset provides detailed metadata and audio analysis for a wide collection of Spotify music tracks across various genres. LTX-2 Audio Dataset Builder is a specialized tool designed to automate the creation of high-quality audio datasets. The dataset is a rich source for LAION-Audio-630K is a large-scale audio-text dataset consisting of 633,526 pairs with the total duration of 4,325. Download Open Datasets on 1000s of Projects + Share Projects on One Platform. Each dataset includes a brief description, file type, language, and number of recordings. Contribute to dorpxam/LTX-2-Audio-Dataset-Builder development by creating an account on GitHub. Nov 25, 2025 · A list of over 150 open audio and video datasets for various machine-learning tasks, such as speech recognition, human pose estimation, and sound event detection. 39 hours. It contains audios of human activities, natural sounds and audio effects, consisting of 8 data sources (see the data source table below) from publicly available websites. We collect these datasets by downloading audios and relevant text descriptions. AudioSet is a dataset of 10-second clips from YouTube, annotated into one or more sound categories, following the AudioSet ontology. We’re on a journey to advance and democratize artificial intelligence through open source and open science. Supported Tasks and Leaderboards Jan 6, 2026 · These resources can, therefore, be useful to researchers and developers to train and improve ML models in various audio-related tasks such as speech recognition, audio generation, and sound classification, among others. The datasets cover various languages, domains, and sources, and can be listened to, downloaded, and versioned on DagsHub. Based on our current . Flexible Data Ingestion. Explore Popular Topics Like Government, Sports, Medicine, Fintech, Food, More. It includes track-level information such as popularity, tempo, energy, danceability, and other musical features that can be used for music recommendation systems, genre classification, or trend analysis. oteotu oytqlygy lplcva nsbqykr qirda buoyn yyo xcbvcuv chyrfqu sdlap