Power BI Introduction to Dataflows Public Preview Feature Date: January 15, 2019 Joe Abbott, Consultant About Joe Abbott Consultant at Pragmatic Works Power BI, Azure, Logic Apps, ADFv2 10+ years working in the Actuarial field
Speak at user groups, webinars, SQL Sat. Grew up in Green Bay, WI Joe Abbott Consultant [email protected] @GrumpyDataGuy Agenda What is a Power BI Dataflow? Why would I use it? Dataflow vs. Dataset
How do I create one? How do I use it? Pro vs. Premium Comparison Security Considerations Bring your own Azure Data Lake Power BI Dataflows What is Power BI? What is a Power BI Dataflow? Why would I use one? What is Power BI?
Power BI is a business analytics solution from Microsoft that lets you visualize your data and share insights across your organization, or embed them in your app or website. Connect to hundreds of data sources and bring your data to life with live dashboards and reports. 1. Power BI Desktop 2. Power BI Service 3.
Power BI Mobile What is a Power BI Many different definitions Dataflow? Self-service data prep Self-service ETL (Extract-Transform-Load) A collection of entities Entities are similar to tables Common misconception:
A Dataflow is NOT a Dataset *Not available for Power BI Report Server (On Prem) What is a Power BI Dataflow? What is a Common Data Model? The Common Data Model (CDM) is a set of a standardized data schemas and a metadata system to allow consistency of data and
its meaning across applications and business processes. Why would I use a Power Easily cleanse, transform, integrate, enrich, and schematize data. BI ingest, Dataflow? Traditional ETL (SSIS) Power BI Dataflows
Why would I use a Power power of M (Power Query) comes to the Power BI Service! BITheDataflow? Power BI Desktop Dataset Power BI Service Power BI Service Dataset
Why would I use a Power Five (5) Reasons to use Power BI Dataflows (or at least try them) BI Dataflow? 1.Intuitive and familiar authoring Power Query (Excel and Power BI users) 2. Auto Orchestration Automatically handles full orchestration of transformations 3. Big Data ready 4. Common Data Model
5. Native integration Sits side-by-side with Datasets, Reports and Dashboards Added benefits Power BI Service Datasets cannot offer Power BI Dataflows How do I create one? How do I use it? How do I create a Power Step BI1 Dataflow? Confirm Tenant Settings are Enabled (https://app.powerbi.com) Requires Admin access (Power BI Administrator role)
Step 2 Navigate to desired App Workspace Create > Dataflow > Add new entities Dataflows are NOT available in My Workspace Step 3 Choose data source > Enter Credentials > Select data/tables/etc. Dem o How do I create a Power BI Dataflow?
How do I refresh the data? Similar to Power BI Datasets Manual Refresh, Scheduled Refresh This will depend on the permissions of the member and App Workspace Unique to Dataflows External referenced data changed Dataflows can refer to data that is not owned by the workspace The Dataflow will proactively look for changes in the data When changes are detected, a recalculation is triggered automatically How do I use a Power BI
Step 1 Dataflow? Open Power BI Desktop Step 2 Get Data > Power BI dataflows (beta) Connector is in Preview read/acknowledge message Step 3 Select Dataflow > Choose Load or Edit Dem
o How do I use a Power BI Dataflow? Pro vs. Premium Comparison Dataflow Capability Power BI Pro Power BI Premium
8 per day 10 GB/User 48 100 TB/node Standardized Schema / Built-In Support for the Common Data Model
Dataflow Management within Power BI, Connector in the Power BI Desktop, Integration with Azure
Scheduled Refresh Total Storage Dataflow Authoring with Power Query Online Computed Entities (in-storage transformations via M) Dataflow incremental refresh Running on Power BI Premium capacity / Parallel execution of transforms Dataflow linked entities Power BI Dataflows
Security All data processed by dataflows is secured in Azure data lake Azure Data Lake Store Gen2 uses standard Access Control Lists (ACLs) to restrict data files to authorized users only. *** WARNING For the initial release*, access is granted to the entire dataflow*** Row-level security is NOT currently supported RLS can be implemented at the dataset level Recommended best practice is to only grant access to Dataflows to those individuals who will be creating the datasets. General report users/consumers should continue to use Datasets, Reports and Dashboards * Future releases plan to move to more granular levels of security
Bring your own Azure Data Azure Data LakeLake Storage (generation 2) issued behind the scenes Common Data Model folder structure transparent to tenant and user No path/connection string Organizations can use their own Azure Data Lake Store Dataflow data and definition files can be used by developers Leverage Azure Data and artificial intelligence (AI) services Integrate dataflow data into internal applications
There is a process documented, with best practices, should you or your organization want to use your own Azure Data Lake. Conclusion What is a Power BI Dataflow? Why would I use it? How do I create one? How do I use it? Pro vs. Premium Comparison Security Considerations Bring your own Azure Data Lake Thank you for your time!
Joe Abbott [email protected] @GrumpyDataGuy www.pragmaticworks.com References Power BI Whitepapers (first link Power BI and Dataflows)
Self-Service data prep in Power BI Creating and using dataflows in Power BI Linked Entities (requires Power BI Premium) Bring your own Azure Data Lake