How to do Data Profiling and display Summary Statistics of a Table in Power BI | Power BI Blog


How to generate Summary Statistics of a Table Data in Power BI

Suppose we have a Table “tbl_OrderDetails” as follows with lot of records.

Now we can do data profiling for this table using Power Query> View options as follows.

1) Column Distribution :

Using the “Colum Distribution” option from Power Query > View options, we can see how the data is being distributed across each Column as follows.

It will display the no.of Distinct and Unique values across each column. The Unique values are part of Distinct but they represent those have only one row/record across that dataset/table.

For a Primary key (Eg: Order_Id ) column, the Distinct and Unique count is same as it has one unique id for each record. 

2) Column Quality :

Using “Colum Quality” option from Power Query > View options, we can see the percentage of Valid, Error, and Empty values across each Column as follows.

3) Column Profile :

With “Colum Profile” option from Power Query > View options, we can see the Data Quality, Summary Statistics, and Data Distribution per each Distinct item of a selected Column only.

For the Column Profile example, I have selected the column “Units Sold” from the “tbl_Sales” table as shown below:

The Column Profile displays the Summary Statistics of Column like Count, Error, Empty, Distinct, Unique, NaN(not a number), Zero, Min, Max, Average, Standard Deviation, Even and Odd values in the Column.

However, if we wants to generate a Summary Statistics table for all the Columns of a Table then we can generate it using the following MQuery.

= Table.Profile(tbl_OrderDetails)

This query will generate the Summary Statistics of the table “tbl_OrderDetails” as follows.

——————————————————————————————————–

Thanks, TAMATAM ; Business Intelligence & Analytics Professional

——————————————————————————————————–



Source link

Be the first to comment

Leave a Reply

Your email address will not be published.


*