Skip to Main Content

Machine Learning

Repositories

Python Datasets

Scikit-learn (sklearn.datasets)

  • iris
  • digits
  • wine
  • breast_cancer
  • olivetti_faces
  • 20newsgroups
  • lfw_people
  • covtype
  • rcv1
  • kddcup99
  • california_housing

TensorFlow Datasets (tensorflow_datasets)

  • mnist
  • fashion_mnist
  • cifar10
  • cifar100
  • imagenet2012
  • imdb_reviews
  • squad
  • wikipedia
  • glue
  • sst2

TorchVision (torchvision.datasets) (For PyTorch)

  • MNIST
  • FashionMNIST
  • CIFAR10
  • CIFAR100
  • ImageNet
  • COCO
  • VOC
  • LSUN
  • KMNIST
  • QMNIST
  • USPS
  • CelebA
  • SVHN
  • Omniglot
  • STL10
  • Cityscapes

R Datasets

Base R (datasets package)

  • iris
  • mtcars
  • airquality
  • PlantGrowth
  • ToothGrowth
  • CO2
  • rock
  • pressure
  • ChickWeight
  • attitude
  • sleep
  • warpbreaks
  • faithful
  • quakes
  • trees
  • USArrests

caret Package (Classification and Regression Training)

  • GermanCredit
  • Sacramento
  • BloodBrain
  • twoClassSim

mlbench Package (Machine Learning Benchmarks)

  • BostonHousing
  • Sonar
  • Ionosphere
  • PimaIndiansDiabetes
  • Glass
  • HouseVotes84
  • BreastCancer

ggplot2 (via ggplot2::mpg)

  • mpg
  • diamonds
  • economics
  • midwest
  • randomForest Package
  • ozone (Ozone pollution dataset)
     
©2018 Morgan State University | 1700 East Cold Spring Lane Baltimore, Maryland 21251 | 443-885-3333 | Privacy | Accessibility