I am still astounded by the number of people I meet that have never heard of Power Query and Power Pivot. The good news is there are lots of people out there in for a pleasant surprise. I think Power BI will solve the “lack of awareness of Power Query” problem over the coming months and years. Power BI is much more visible as can been seen in this Google Trends chart that I first created in Dec 2014.
I have written a blog titled “What is Power Pivot” before and I refer people to that page all the time – this helps me quickly give people a detailed summary of what they need to know. Today I am writing about Power Query for the same reason – to have something that I can point people towards when they are just starting out.
So What is Power Query (aka Get and Transform)?
Power Query is an Extract, Transform and Load tool (ETL) that helps Excel and Power BI users connect to data, transform the shape of the data as required and then load the data for further use, typically into Excel, Power BI, CSV etc. ETL tools are not new – there are many enterprise strength tools in the marketplace that are used by IT professionals. What makes Power Query such a ground-breaking tool is it is the first tool built specifically for business users.
Who can benefit from Power Query?
If one or more of the following scenarios applies to you, then Power Query is a great tool to make your life better.
- You spend hours and hours of repetitive work combining data from multiple files (CSV, Excel) into a single Excel workbook before you can start the real work.
- You have data in a file but it is the wrong shape for the task at hand. You need to spend time “reshaping” the data before you can use it.
- You write lots of VLOOKUP formulas over multiple tables of data to “join” the data together.
- You manually have to “cleanse” your data every month removing data and errors that you already know about but you can’t permanently solve back at the source.
- You need to get data from web pages into Excel so you can analyse the data. You cut and paste the data and spend hours reformatting it to remove the noise from the data.
What Can Power Query Do?
Power Query can be used for the following (and probably many more things too).
- A data loading tool
- A data cleansing tool (remove unwanted columns, rows, characters etc)
- A data reshaping tool including a very powerful “unpivot data” function (example shown below)
- A tool to combined data from multiple worksheets or workbooks into a single file/table
- A tool to join data from different sources into 1 coherent data set or table.
- An auditing tool – compare lists of data and find the similarities and differences.
- A website data scraping tool – easily load tabular data from a web page into Excel or Power BI.
- A data creation tool (eg it can create a calendar table for Power Pivot). Read my blog posts about how create a regular Calendar table here, or a 445 Calendar table here.
What Makes Power Query So Great?
Power Query is great for many reasons including:
- It has a simple User Interface that any Excel user can learn. Note the simple menu driven features in the image below.
- The user interface is built on top of a very powerful formula language that has immense capabilities well beyond what most users will ever need. If you do ever need something more complex than the UI can support, the language is there (as shown above) and can be used to build a working solution without switching tools.
- Power Query records repeatable “Process Steps” that can be reused over and over without destroying the underlying data. You can go back through the Process Steps and see the data before the transformation took place. You can even insert new transformation steps earlier in the process flow if needed.
- Repeatable process steps means that you can build the steps for one set of data and then you can “reapply” those same process steps to a different set of data (provided both sets of data are fundamentally the same “shape”). This is great for updating weekly or monthly data (for example).
Versions of Power Query
Power Query was first released as a free Addin for Excel 2010 and Excel 2013. After you install the Addin, you get a new menu on the Excel Ribbon shown below.
You can find the download details for Excel 2010/2013 on my downloads page here.
In Excel 2016 the product was fully integrated into Excel and renamed “Get and Transform” – unfortunately. To find it in Excel 2016, you need to go to the Data tab (#1 below) and then you will see the Get and Transform group (#2 below).
Power BI Desktop
Power Query is also fully integrated into Power BI Desktop where it is called “Get Data” – unfortunately. You can find it by navigating to the Home Tab (#1 below) and then clicking Get Data (#2 below). Power Query (Get Data) is the only way to load data in Power BI Desktop, so if you have used this tool to build a new workbook then you have already used Power Query (Get Data).
Why Microsoft couldn’t stick with the name “Power Query” is anyone’s guess. Maybe they thought users were “scared off” by the “Power” term – who knows.
The Power Query Formula Language (PQL)
The official name of the underlying language of Power Query is the “Power Query Formula Language”. This language was developed from another language called “M”. Both these names are “unfortunate” as the former is a mouth full and the later is very difficult to search for in Google. For these reasons, I have come up with my own name (which I hope will catch on) called Power Query Language – PQL (pronounced Peequel – as in SQL Seequel). PQL is a much better name, easy to search on Google and has a good fit to is cousin SQL.
Where to learn more
I often blog about how to get value from Power Query. You can subscribe to my blog via email at the top of this page. You can also explore the previous Power Query blog posts that have have written here.
There is a great book by Ken Puls and Miguel Escobar – I wrote a book review here. I am also a big fan of Ken’s blog that you can find here.
March 29, 2016 7:00 am