|
- GitHub - huggingface datasets: The largest hub of ready-to-use . . .
🤗 Datasets is a lightweight library providing two main features: one-line dataloaders for many public datasets: one-liners to download and pre-process any of the major public datasets (image datasets, audio datasets, text datasets in 467 languages and dialects, etc ) provided on the HuggingFace Datasets Hub
- datasets · GitHub Topics · GitHub
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data computer-vision deep-learning geospatial models pytorch remote-sensing satellite-imagery datasets earth-observation transforms torchvision
- Curated open data · GitHub
Relevant open data curated Curated open data has 148 repositories available Follow their code on GitHub
- easy-dataset README. zh-CN. md at main - GitHub
A powerful tool for creating fine-tuning datasets for LLM - ConardLi easy-dataset
- GitHub - luminati-io Free-datasets: A collection of multiple free . . .
This repository contains a collection of free datasets with thousands of records for use in data analysis, machine learning, and research The datasets span multiple domains, from business to social media data All the datasets were collected with our Web Scraper APIs Want custom datasets or large datasets from popular and hard to scrape domains?
- datasets awesome-data: Curated list of quality open datasets - GitHub
The awesome section presents collections of high quality datasets organized by topic Home page for awesome collections is located in the awesome-data repository on github and should be modified from there See the live page here:
- ConardLi easy-dataset - GitHub
Domain Labels: Intelligently builds global domain labels for datasets, with global understanding capabilities; Answer Generation: Uses LLM API to generate comprehensive answers and Chain of Thought (COT) Flexible Editing: Edit questions, answers, and datasets at any stage of the process
- Datasets For Recommender Systems - GitHub
In order to use RecBole, you need to convert these original datasets to the atomic file which is a kind of data format defined by RecBole We provide two ways to convert these datasets into atomic files: Download the raw dataset and process it with conversion tools we provide in this repository Please refer to conversion tools
|
|
|