|
- GitHub - huggingface datasets: The largest hub of ready-to-use . . .
🤗 Datasets is a lightweight library providing two main features: one-line dataloaders for many public datasets: one-liners to download and pre-process any of the major public datasets (image datasets, audio datasets, text datasets in 467 languages and dialects, etc ) provided on the HuggingFace Datasets Hub
- GitHub - paperswithcode paperswithcode-data: The full dataset behind . . .
All papers with abstracts Links between papers and code Evaluation tables Methods Datasets The last JSON is in the sota-extractor format and the code from there can be used to load in the JSON into a set of Python classes At the moment, data is regenerated daily Part of the data is coming from the sources listed in the sota-extractor README
- datasets · GitHub Topics · GitHub
GitHub is where people build software More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects
- easy-dataset README. zh-CN. md at main - GitHub
A powerful tool for creating fine-tuning datasets for LLM - ConardLi easy-dataset
- Curated open data · GitHub
Pinned awesome-data Public Curated list of quality open datasets 876 124 covid-19 Public Novel Coronavirus 2019 time series data on cases Python 1 2k 602 country-codes Public
- A collection of datasets originally distributed in R packages
Rdatasets is a collection of 3485 datasets which were originally distributed alongside the statistical software environment R and some of its add-on packages The goal is to make these data more broadly accessible for teaching and statistical software development
- GitHub - doormanBreach FreeDatabreaches: Download Free Databreaches
This repository is a centralized hub for data breaches that have occurred over the years Whether you are a cybersecurity researcher, data analyst, or simply curious about data breaches, you can access, download, and explore these datasets
- A bunch of some 200 datasets. You can call it mini-kaggle :)
tsv data-science data csv database ml datasets nlp-machine-learning image-files mini-kaggle Readme Apache-2 0 license
|
|
|