1. Create Account

Go to:

Hugging Face

Create an account.


2. Install Git LFS

On Ubuntu/Debian:

sudo apt update
sudo apt install git-lfs

Initialize:

git lfs install

3. Create a Dataset Repository

Open:

New Dataset Repo


4. Generate Access Token

Open:

Access Tokens

Create token with:

  • Write permission

Copy the token. This token can be used when push it git repo


5. Clone Dataset Repo

Replace USERNAME and DATASET:

git clone https://huggingface.co/datasets/USERNAME/DATASET

Example:

git clone https://huggingface.co/datasets/suhailvs/kerala-html-dataset

6. Commit and Push

cd kerala-html-dataset

git add .
git commit -m "Add batch 001"
git push

Large files are automatically handled by Git LFS.

7. optional login from terminal

pip install huggingface_hub
huggingface-cli login

<
Previous Post
Summary of the Kerala Budget 2025–26
>
Blog Archive
Archive of all previous blog posts