Now that we have learned a bit about data manipulation and retrieval, we can take next step. That is to start analyzing the data we have gathered. And the easiest way for data analysis is through graph plotting. Don't worry you won't have to calculate every x and y coordinate to plot on graph because python or more specifically matplotlib will do it for us. Q.What is Matplotlib? Ans. It is a python package that makes graph plotting very easy. Matplotlib is a plotting library for the Python programming language and its numerical mathematics extension NumPy. Wikipedia Later on we will build on to the graph plotting using another python library Seaborn, it is built upon Matplotlib. At this stage learning these libraries might feel very boring to you but trust me if you are to become a data scientist some day, these will be your bread and butter. So focus and absorb as much as you can. The best resource you need to learn matplotlib is the course that i have mentioned in
The name sounds funny but Pandas is one of the most useful python library for data science. Most of the time spend by data scientists is used to clean and manipulate the data. Pandas provide the useful data structures such as Series and Data Frame . These are easy to manipulate and the and with them possibilities are limitless. Installing Pandas by PIP: Just type this in your terminal/shell Pip install pandas Kudos. Now you have pandas installed. Next -> Start exploring pandas Resource : Pandas for beginners Next thing we are going to use is Jupyter Notebook. The Jupyter Notebook is an open-source web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text. Uses include: data cleaning and transformation, numerical simulation, statistical modeling, data visualization, machine learning, and much more Jupyter.org This link will guide how to use Jupyter Notebooks. after you are done with that go