GitHub - huggingface datasets: The largest hub of ready-to-use . . . 🤗 Datasets is a lightweight library providing two main features: one-line dataloaders for many public datasets: one-liners to download and pre-process any of the major public datasets (image datasets, audio datasets, text datasets in 467 languages and dialects, etc ) provided on the HuggingFace Datasets Hub
datasets · GitHub Topics · GitHub GitHub is where people build software More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects
Curated open data · GitHub Relevant open data curated Curated open data has 152 repositories available Follow their code on GitHub
GitHub - ncbi datasets: NCBI Datasets is a new resource that lets you . . . NCBI Datasets data reports NCBI Datasets data packages include data report files that contain metadata about the requested records Data report schemas describe each type of data report, including available fields, with descriptions and examples
Awesome UK Government Datasets - GitHub Awesome UK Government Datasets A curated list of awesome publicly available datasets published by or relevant to the UK government with the goal of making these resources more accessible to the broader data community This repo is inspired by awesome-public-datasets and maintained by i AI
A collection of datasets originally distributed in R packages Rdatasets is a collection of 3499 datasets which were originally distributed alongside the statistical software environment R and some of its add-on packages The goal is to make these data more broadly accessible for teaching and statistical software development
TensorFlow Datasets - GitHub TFDS is a collection of datasets ready to use with TensorFlow, Jax, - tensorflow datasets