Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
wei's picture
2 14

wei

zhuww
Gigako's profile picture
·

AI & ML interests

None yet

Organizations

None yet

liked a Space 9 months ago
Running
134

TxT360: Trillion Extracted Text

📖
134

Explore the TxT360 LLM pre‑training dataset online

liked 2 datasets 9 months ago

HuggingFaceFW/finepdfs

Viewer • Updated Apr 3 • 476M • 63.9k • 876

m-a-p/FineFineWeb

Viewer • Updated Dec 19, 2024 • 4.89B • 647k • 146
liked a dataset 11 months ago

argilla/ifeval-like-data

Viewer • Updated Oct 17, 2024 • 606k • 2.1k • 49
liked 5 datasets over 1 year ago

wangrongsheng/HealthCareMagic-100k-en

Viewer • Updated May 7, 2023 • 112k • 196 • 20

shibing624/sharegpt_gpt4

Viewer • Updated Feb 23, 2024 • 103k • 1.09k • 138

Shitao/bge-m3-data

Viewer • Updated Apr 26, 2024 • 172k • 180 • 54

HuggingFaceFW/fineweb-edu

Viewer • Updated Jul 11, 2025 • 3.5B • 506k • 1.13k

m-a-p/COIG-CQIA

Viewer • Updated Apr 18, 2024 • 44.7k • 21.2k • 738
liked 5 datasets almost 2 years ago

Mutonix/RefGPT-Fact-v2

Viewer • Updated Mar 13, 2024 • 119k • 116 • 18

philschmid/sharegpt-raw

Preview • Updated Apr 4, 2023 • 168 • 91

aharley/rvl_cdip

Updated Sep 10, 2024 • 1.48k • 81

HuggingFaceM4/pascal_voc

Updated Sep 23, 2022 • 124 • 1

shibing624/medical

Updated Oct 12, 2024 • 2.32k • 432
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs