
Category Specific Datasets
Multimedia Commons: Collection of audio and visual features computed for the nearly 100 million Creative Commons-licensed Flickr images and videos in the YFCC100M dataset from Yahoo! Labs, along with ground-truth annotations for selected subsets.
Artificial Intelligence and Machine Learning Data
Regular price
$265.00
Tax included.
Shipping calculated at checkout.
Please note: specifying a category of data may delay processing of your order. For fastest processing and shipping please select the "Any Drive" option on the Home Page!
We will send you a new 8TB hard drive at our actual cost containing one or more of the following datasets, and you will receive a guaranteed Filecoin storage deal for one year:
Berkley Self-Driving Data: Open-source video data from Berkley's Self Driving Program containing 100,000 videos representing more than 1000 hours of driving experience with more than 100 million frames.
Multimedia Commons: Collection of audio and visual features computed for the nearly 100 million Creative Commons-licensed Flickr images and videos in the YFCC100M dataset from Yahoo! Labs, along with ground-truth annotations for selected subsets.
NLP fast.ai: Some of the most important datasets for NLP, with a focus on classification, including IMDb, AG-News, Amazon Reviews (polarity and full), Yelp Reviews (polarity and full), Dbpedia, Sogou News (Pinyin), Yahoo Answers, Wikitext 2 and Wikitext 103, and ACL-2010 French-English 10^9 corpus.
Mevadata: Video data of human activity, both scripted and unscripted, collected with roughly 100 actors over several weeks. The data was collected with 29 cameras with overlapping and non-overlapping fields of view.
Google Ngrams: N-grams are fixed size tuples of items. In this case the items are words extracted from the Google Books corpus. The n specifies the number of elements in the tuple, so a 5-gram contains five words or characters.
FMA: The Free Music Archive (FMA), an open and easily accessible dataset suitable for evaluating several tasks in MIR, a field concerned with browsing, searching, and organizing large music collections.
Your drive may also contain other valuable datasets, such as Landsat 8, 1000 Genomes, and ENCODE, to bring the total data stored to fill the capacity of an 8TB HDD.
HDD Specs:
- Capacity: 8TB
- Form factor: 3.5"
- Connection type: SATA, 6 Gb/s
- Spindle Speed: 7200 RPM
- Buffer size: 256MB
- Weight: 3lbs