WEBVTT 1 00:00:00.070 --> 00:00:03.020 - [Instructor] Info provides a summary of the data frame 2 00:00:03.020 --> 00:00:06.050 including the number of entries, the data type, 3 00:00:06.050 --> 00:00:08.050 and the number of non-null entries 4 00:00:08.050 --> 00:00:10.090 for each series in the data frame. 5 00:00:10.090 --> 00:00:12.080 This is important because often when working 6 00:00:12.080 --> 00:00:16.030 with a real data set, there will be missing data. 7 00:00:16.030 --> 00:00:18.030 You want a view of this to determine 8 00:00:18.030 --> 00:00:20.050 how you will handle this missing data. 9 00:00:20.050 --> 00:00:24.030 Let's head back to the Jupyter notebook. 10 00:00:24.030 --> 00:00:28.030 So I enter the name of the data frame. 11 00:00:28.030 --> 00:00:29.050 We can see that we do not have 12 00:00:29.050 --> 00:00:31.030 any missing data in this data set. 13 00:00:31.030 --> 00:00:34.010 There are 29,216 entries, 14 00:00:34.010 --> 00:00:36.050 and none of them are non-null. 15 00:00:36.050 --> 00:00:39.020 What is also useful about the info method 16 00:00:39.020 --> 00:00:41.000 is that it also gives you the data types 17 00:00:41.000 --> 00:00:43.020 for the different series of columns. 18 00:00:43.020 --> 00:00:44.080 This is useful when you have to do 19 00:00:44.080 --> 00:00:46.080 any comparisons or queries. 20 00:00:46.080 --> 00:00:48.090 So for example, if you are querying what is a string, 21 00:00:48.090 --> 00:00:50.060 then you will know that you have to use quotes, 22 00:00:50.060 --> 00:00:53.000 and if you are using a comparison for an integer 23 00:00:53.000 --> 00:00:56.090 or a float value, then you won't need to use those quotes. 24 00:00:56.090 --> 00:00:59.080 In the next video, we will learn how to use Pandas 25 00:00:59.080 --> 00:00:59.080 to do some basic data analysis.