Data Sets

Image Dataset Name Short Description Tags File Size
Grammar and Online Product Reviews A list of 71,045 online reviews from 1,000 different products. 9 MB
Indian Premier League (Cricket) Ball-By-Ball Cricket Data
  • Sports
  • 1 MB
    Honey production In The USA (1998-2012) Honey Production Figures and Prices by National Agricultural Statistics Service
  • Agriculture
  • Neuroscience
  • 24 KB
    Global Commodity Trade Statistics Three decades of global trade flows
  • Data analysis
  • Economics
  • EarthScience
  • 121 MB
    Age Detection of Indian Actors This is a fascinating challenge for any deep learning enthusiast.
  • Data analysis
  • Business
  • Healthcare
  • 48MB
    LibriSpeech This dataset is a large-scale corpus of around 1000 hours of English speech.
  • Data analysis
  • Education
  • 60 GB
    Million Song Dataset The Million Song Dataset is a freely-available collection of audio features and metadata for a million contemporary popular music tracks. 280 GB
    Free Music Archive (FMA) FMA is a dataset for music analysis.
  • Data analysis
  • NaturalLanguage
  • 100 GB
    Free Spoken Digit Dataset Another entry in this list for inspired by the MNIST dataset!
  • Education
  • 10 MB
    Machine Translation of Various Languages This dataset consists of training data for four European languages.
  • Data analysis
  • NaturalLanguage
  • 15 GB
    Machine Translation of Various Languages This dataset consists of training data for four European languages.
  • Data analysis
  • NaturalLanguage
  • 15 GB
    The Blog Authorship Corpus This dataset consists of blog posts collected from thousands of bloggers and has been gathered from blogger.com.
  • Data analysis
  • NaturalLanguage
  • 300 MB
    The Wikipedia Corpus This dataset is a collection of a the full text on Wikipedia.
  • Data analysis
  • Business
  • DataChallenges
  • NaturalLanguage
  • 20 MB
    Sentiment140 Sentiment140 is a dataset that can be used for sentiment analysis.
  • Data analysis
  • NaturalLanguage
  • PublicDomains
  • 80 MB
    Fashion-MNIST Fashion-MNIST consists of 60,000 training images and 10,000 test images.
  • Data analysis
  • 30 MB
    CIFAR-10 This dataset is another one for image classification.
  • Data analysis
  • 170 MB
    MNIST MNIST is one of the most popular deep learning datasets out there.
  • Data analysis
  • ImageProcessing
  • 50MB
    Twenty Newsgroups This dataset, as the name suggests, contains information about newsgroups. 20 MB
    IMDB Reviews This is a dream dataset for movie lovers. It is meant for binary sentiment classification and has far more data than any previous datasets in this field. Apart from the training and test review examples, there is further unlabeled data for use as well. Raw text and preprocessed bag of words formats have also been included.
  • Data analysis
  • ComputerNetworks
  • NaturalLanguage
  • SocialNetworks
  • 80 MB
    The Street View House Numbers This is a real-world image dataset for developing object detection algorithms. This requires minimum data preprocessing. It is similar to the MNIST dataset mentioned in this list, but has more labelled data (over 600,000 images). The data has been collected from house numbers viewed in Google Street View.
  • Data analysis
  • Economics
  • ImageProcessing
  • 2.5 GB

    © copyright 2017 www.aimlmarketplace.com. All Rights Reserved.

    A Product of HunterTech Ventures