- Usually when we're faced with a large amount of data
one of the first steps is to compute summary statistics
for the data and the consideration.
NumPy helps us do that, with a set of functions
for calculating aggregates for NumPy arrays.
Which take an array as an input,
and by the fault, return a scalar, as output.
These are statistics such as averages, standard deviation,
functions for calculating the sum,
and the product developments in array.
Let's jump into our first example and see how we can compute
the sum of all values in array.
There is a simple function to achieve this called sum.
First, lets import numpy as np
and create a one dimensional array called first arr.
Then we contain tens from 10 to 100.
And then 2, two dimensional arrays called second arr
that has dimensions 3 by 3.
And third arr that has dimensions 2 by 5.
Let's calculate the sum for all three arrays.
We just need to type first arr dot sum,
to calculate the sum of all values in first arr.
And similarly, for the second and third array.
What if you want to calculate the sum of each column
in the second array, second arr?
Then we have to pause the axis as argument,
in this case, axis equal 0.
We can do this by typing, second arr dot sum,
axis equal 0.
For the sum of each row,
we will type, second arr dot sum,
axis equals 1.
Next, lets see a function called prod.
Prod functions finds the product of all elements in array.
We will calculate the product for all three arrays.
Just as for sum,
we can calculate the product only for columns.
Let's try it out on at third arr, by typing
third arr dot prod, X is equal 0.
Now, we'll use another statistics function
to find the average.
The average function
will return the average of a given array.
If you pass an axis, it will return average just for columns
or just for rows.
Let's calculate the average for the three arrays
by typing, np dot average.
First arr for the first array.
And similarly, for the second and third array.
To extremely use the statistics function,
our min and max functions to find the minimum value,
and maximum value of a given array.
Let's find min and max for our first array.
We'll just type np dot min first arr.
Np dot max first arr.
Our last stop is to learn functions for calculating mean
and standard deviation of the given input array.
To get the mean for our first array,
type np dot mean, first arr.
And for standard deviation type np dot std,
first arr.
NumPy provides many other different aggregate functions
that we won't cover and discuss here,
but you can easily search and find them
in a NumPy documentation.