🗃️

Hugging Face Datasets

available

Create and manage datasets on Hugging Face Hub — initialize repos, define configs, stream row updates, and run SQL-based queries.

ai-ml

What This Skill Does

Create and manage datasets on Hugging Face Hub — initialize repos, define configs, stream row updates, and run SQL-based queries.

Capabilities

  • Dataset Creation — Initialize and configure dataset repositories
  • Row Management — Stream updates and manage dataset rows
  • SQL Queries — Query and transform datasets using SQL
  • Config Management — Define dataset configs and system prompts

How It Works

  1. Create or select a dataset repository on the Hub
  2. Define dataset configuration and schema
  3. Add, update, or query rows using CLI or SQL
  4. Changes are synced to the Hub automatically

Example Usage

"Create a new instruction-tuning dataset"
"Query my dataset using SQL to filter entries"
"Stream new rows to an existing dataset"