In this article I’m going to scrape the data from wikipedia site and analyze it in Just 1 minute. Start the stopwatch:
Data for Analysis is Padma Vibhushan Awards, which is one of the highest civilian award given in the following field:
Literature & Education, Arts, Public Affairs, Trade & Industry, Social Work, Civil Service, Medicine, Science & Engineering, Sport
Wikipedia URL for the data:
Now to import the data from this site to Google spreadsheet, We will use a simple one line formula to Imports data from a table or list within an HTML page.
Check other functions also:
IMPORTFEED: Imports a RSS or ATOM feed.IMPORTRANGE: Imports a range of cells from a specified spreadsheet.
IMPORTXML: imports data from any of various structured data types including XML, HTML, CSV, TSV, and RSS and ATOM XML feeds.
To scrape the data from the above URL, our above values for IMPORTHTML would be as follows:
Select cell A1 and add the formula as shown in the figure below and press Enter Key and Magic!! the data is downloaded in the spreadsheet
Let’s find out the Awards given by Year from 1954-2017:
Select Column A in the sheet and Select Insert > Chart > Column Chart
And your Beautiful Data Visualization is ready!!
Rest of the visualization and Dashboard I will explain in my next blog “Dashboarding with Google Spreadsheet”
Stop the watch and check the time. Hope it’s far less than a minute.