Interesting Datasets A collection of datasets that I come across LDJnr/Capybara Viewer • Updated Jun 7, 2024 • 16k • 3.41k • 247 teknium/openhermes Viewer • Updated Sep 7, 2023 • 243k • 979 • 219 VMware/open-instruct Viewer • Updated Jul 12, 2023 • 143k • 155 • 44 euclaise/WritingPrompts_preferences Viewer • Updated Dec 25, 2023 • 265k • 68 • 10
Requires Filtering Datasets that HAS TO BE FILTERED. Squish42/bluemoon-fandom-1-1-rp-cleaned Updated Jul 9, 2023 • 164 • 78
Augmentable A collection of datasets that should be augmented further with gpt-4 allenai/ai2_arc Viewer • Updated Dec 21, 2023 • 7.79k • 354k • 314 codeparrot/apps Updated Oct 20, 2022 • 18.4k • 201 facebook/belebele Viewer • Updated Aug 12, 2024 • 110k • 10.2k • 122 google/boolq Viewer • Updated Jan 22, 2024 • 12.7k • 36.3k • 99
Interesting Datasets A collection of datasets that I come across LDJnr/Capybara Viewer • Updated Jun 7, 2024 • 16k • 3.41k • 247 teknium/openhermes Viewer • Updated Sep 7, 2023 • 243k • 979 • 219 VMware/open-instruct Viewer • Updated Jul 12, 2023 • 143k • 155 • 44 euclaise/WritingPrompts_preferences Viewer • Updated Dec 25, 2023 • 265k • 68 • 10
Augmentable A collection of datasets that should be augmented further with gpt-4 allenai/ai2_arc Viewer • Updated Dec 21, 2023 • 7.79k • 354k • 314 codeparrot/apps Updated Oct 20, 2022 • 18.4k • 201 facebook/belebele Viewer • Updated Aug 12, 2024 • 110k • 10.2k • 122 google/boolq Viewer • Updated Jan 22, 2024 • 12.7k • 36.3k • 99
Requires Filtering Datasets that HAS TO BE FILTERED. Squish42/bluemoon-fandom-1-1-rp-cleaned Updated Jul 9, 2023 • 164 • 78