You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

5.9 KiB

None <html> <head> </head>

Importing data

In [2]:
from pandas import read_csv
url = "https://raw.githubusercontent.com/jbrownlee/Datasets/master/iris.csv"
names = ['sepal-length', 'sepal-width', 'petal-length', 'petal-width', 'class']
dataset = read_csv(url, names=names)
dataset.head(10)
Out[2]:
sepal-length sepal-width petal-length petal-width class
0 5.1 3.5 1.4 0.2 Iris-setosa
1 4.9 3.0 1.4 0.2 Iris-setosa
2 4.7 3.2 1.3 0.2 Iris-setosa
3 4.6 3.1 1.5 0.2 Iris-setosa
4 5.0 3.6 1.4 0.2 Iris-setosa
5 5.4 3.9 1.7 0.4 Iris-setosa
6 4.6 3.4 1.4 0.3 Iris-setosa
7 5.0 3.4 1.5 0.2 Iris-setosa
8 4.4 2.9 1.4 0.2 Iris-setosa
9 4.9 3.1 1.5 0.1 Iris-setosa

This is the part of code you have to solve and deliver as a part of your report.

Statistical Summary

For this section you can create your own functions or also you can writte down the code as chunks to compute:

  • Mean
  • Number of elements per properti
  • Standard deviation

Plots and data visualization

Make the next plots for data exploration and visulization:

  • Box and Whiskers plot
  • histogram plot
  • scatter plot for sepal length and width
  • scatter plot for petal length and width
</html>