Читать книгу The Informed Company - Dave Fowler - Страница 2

Table of Contents

Оглавление

Cover

Title Page

Copyright

Dedication

About This Book Why Write This Book Who This Book Is For Who This Book Is Not For Who Wrote the Book Who Edited the Book Influences How This Book Was Written How to Read This Book

Foreword

Introduction Merging Business Context with Data Information The Four Stages of Agile Data Organization

STAGE 1: SOURCE aka Siloed Data Chapter One: Starting with Source Data Common Options for Analyzing Source Data Chapter Two: The Need to Replicate Source Data Chapter Three: Source Data Best Practices Keep a Complexity Wiki Page Snippet Dictionary Use a BI Product Double Check Results Keep Short Dashboards Design Before Building

STAGE 2: DATA LAKE aka Data Combined Chapter Four: Why Build a Data Lake? What Is a Data Lake? Reasons to Build a Data Lake Summarized Chapter Five: Choosing an Engine for the Data Lake Modern Columnar Warehouse Engines Modern Warehouse Engine Products Database Engines Recommendation Chapter Six: Extract and Load (EL) Data ETL versus ELT EL/ETL Vendors Extract Options Load Options Multiple Schemas Other Extract and Load Routes Chapter Seven: Data Lake Security Access in Central Place Permission Tiers Chapter Eight: Data Lake Maintenance Why SQL? Data Sources Performance Upgrade Snippets to Views

10  STAGE 3: DATA WAREHOUSE aka the Single Source of Truth Chapter Nine: The Power of Layers and Views Make Readable Views Layer Views on Views Start with a Single View Chapter Ten: Staging Schemas Orient to the Schemas Pick a Table and Clean It Other Staging Modeling Considerations Building on Top of Staging Schemas Chapter Eleven: Model Data with dbt Version Control Modularity and Reusability Package Management Organizing Files Macros Incremental Tables Testing Chapter Twelve: Deploy Modeling Code Branch Using Version Control Software Commit Message Test Locally Code Review Schedule Runs Chapter Thirteen: Implementing the Data Warehouse Manage Dependencies Combine Tables Within Schemas Combine Tables Across Schemas Keep the Grain Consistent Create Business Metrics Keeping Accurate History Chapter Fourteen: Managing Data Access How to Secure Sensitive Data in the Data Warehouse How to Secure Sensitive Data in a BI Tool Chapter Fifteen: Maintaining the Source of Truth Track New Metrics Deprecate Old Metrics Deprecate Old Schemas Resolve Conflicting Numbers Handling Ongoing Requests and Ongoing Feedback Updating Modeling Code Manage Access Tuning to Optimize Code Review All Modeling Maintenance Checklist

11  STAGE 4: DATA MARTS aka Data Democratized Chapter Sixteen: Data Mart Implementation Views on the Data Warehouse Segment Tables Access Update Chapter Seventeen: Data Mart Maintenance Educate Team Identifies Issues Identify New Needs Help Track Success Chapter Eighteen: Modern versus Traditional Data Stacks: What's Changed? What's Changed? Chapter Nineteen: Row‐ versus Column‐Oriented Database Row‐Oriented Databases Column‐Oriented Databases Summary Chapter Twenty: Style Guide Example Simplify Clean Naming Conventions Share It Chapter Twenty-One: Building an SST Example First Attempt—Same Tables with Prefixes Second Attempt—Operational Schema (Source Agnostic) Third Attempt—Application Separate, Other Sources Smashed Less Planning, More Implementing

12  Acknowledgments and Contributions Thank‐yous

13  Index

14  End User License Agreement

The Informed Company

Подняться наверх