Data Lineage: Planning, Documenting, Applying
Speaker: Irina Steenbeek
18-19 October 2022
All public courses are available as in-house training. Contact us for more information.
Overview
Data lineage has become a daily demand. Many companies consider the opportunities to initiate a data lineage business case. However, the implementation of data lineage brings a lot of challenges.
Data lineage remains an abstract, unknown concept for many stakeholders. The data management community doesn’t have an aligned definition of it. Therefore, each company should start a data lineage initiative with the development of a data lineage metamodel.
The documentation is complex and resource-consuming. In any case, the documentation of data lineage requires much effort and many resources. To make this initiative successful, an organization should clearly identify its requirements regarding data lineage, assess its resources, scope and plan the initiative correspondingly. The correct scoping and realistic planning are one of the key success factors. Still, the implementation method and required software applications depend on the chosen data lineage model and scope.
Applying the outputs of data lineage remains a challenge in “business as usual” operations. At the beginning of the data lineage initiative, many stakeholders are unfamiliar with the concept. Their initial expectations often do not match the real outputs. Furthermore, the use of data lineage requires some technical skills and knowledge. Therefore, an organization should have a clear vision about required data lineage application areas.
In a 2-day Masterclass, we aim to:
- Provide the definition and model of data lineage
- Demonstrate best practices in data lineage documentation
- Discuss key business areas of data lineage usage
All public courses are available as in-house training. Contact us for more information.
Learning Objectives
During the Masterclass, participants will become familiar with:
- The concept of data lineage
- Relationships between data lineage and other similar concepts
- The metamodel of data lineage
- The method to identify and prioritize business drivers for data lineage
- Data lineage stakeholder analysis
- Methods to scope a data lineage initiative
- Accountabilities regarding data lineage
- Methods and approaches to implement data lineage
- Various data lineage solutions and methods of their evaluation
- Key methods of data lineage documentation
- Key areas and challenges of data lineage usage
After attending the Masterclass, participants will be able to:
- Design the data lineage metamodel that fits a company’s needs and resources
- Assess the readiness of the company for the data lineage initiative
- Prepare requirements for data lineage
- Scope the data lineage initiative
- Analyze various methods and approaches to performing data lineage documentation
- Analyze the required data lineage solution
- Assess risks and success factors
Course Outline
The Masterclass includes short lectures, 23 exercises, and Q&A discussions.
Topic 1: Introduction
- Trends and challenges with data lineage
- Data lineage trends
- Key challenges with data lineage implementation
- Introduction into a data lineage concept
- Data lineage definitions
- An example of data lineage
- Introduction into a method to build a data lineage business case
- Three key phases in building a data lineage business case
Topic 2: Planning a data lineage initiative
- Identify business drivers
- Internal and external business drivers
- Critical business drivers
- Identify key stakeholders and sponsors
- The definitions of a “business stakeholder”
- Stakeholder analysis
- Different stakeholder types
- Sponsors for a data lineage initiative
- Define a metamodel of data lineage: key concepts
- Data and metadata
- Metadata classification
- Data lineage and other comparable concepts
- Relationships between a data “lifecycle,” “chain,” and “lineage” concepts
- Define a metamodel of data lineage
- Components of data lineage (DL) at various abstraction levels
- DL components at the business level
- DL components at the conceptual / semantic level
- DL components at the logical / solution level
- DL components at the physical level
- A metamodel of data lineage
- Assess a company’s readiness for a data lineage initiative
- A capability model of data management
- A “heat map” method to assess a company’s readiness
- DL components at the conceptual / semantic level
- Scope a data lineage initiative
- Mapping between requirements and a data lineage metamodel
- Various factors to scope of a data lineage initiative
- Assess risks and develop mitigation actions
- Key risk factors for a data lineage initiative
Topic 3: Implementing data lineage
- Develop data lineage requirements
- Different types of data lineage
- General requirements for data lineage
- Requirements for vertical data lineage
- Requirements for horizontal data lineage
- Choose an approach and method of documentation
- The “enterprise” coverage
- The method of documentation
- The “length,” “depth,” and a set of data lineage components and objects
- The direction of documentation
- A program/project management approach
- Metadata management maturity
- Define roles and accountabilities
- Data management roles
- Factors that influence the roles’ design
- Different types of data management roles
- RACI matrix for a data lineage project
- Choose appropriate tools
- Key steps in searching for a data lineage solution
- Various types of data lineage solutions
- Document data lineage
- Two types of data lineage documentation
- Basics of descriptive data lineage
- Basics of automated data lineage
Topic 4: Exploring data lineage
- Build analytical tools and reports
- The key goals of data lineage analytics
- Verify metadata quality
- Different metadata control types
- Monitoring, versioning control, and archiving requirements
- Use data lineage for various data management initiatives:
- Metadata management
- Critical data
- Data quality checks
- Master and reference data
- Data DevOps and migration projects
- Driver-based planning
- Setting up a data management framework
Who It's For
- Data management and business professionals can get ideas about data lineage and its application areas.
- Professionals with a technical background may gain a better understanding of business needs and requirements for data lineage.
- Project management professionals can become familiar with the best practices of data lineage implementation.
Speaker
Irina Steenbeek
Managing Director
Data Crossroads
Dr. Irina Steenbeek is a data management practitioner with over 11 years of experience. The key areas of her professional expertise are the data management maturity assessment, implementation of data management frameworks, and data lineage. Irina has practical experience in software implementation such as ERP and DWH/BI, management consultation, financial and business controls, and data science. Through the years, she has worked for global institutions as well as large- and medium-sized organizations in different sectors, including but not limited to financial institutions, professional services, and IT companies. In 2016, she founded Data Crossroads – a training and coaching services enterprise in data management. Data Crossroads focuses on assisting companies in improving their decision-making by setting up an effective data management framework that fits-for-purpose the company business goals and resources. Irina is a strong believer that the success of data management initiatives is based on the combination of a pragmatic approach and clear and transparent methodology. She has shared her approach and implementation experience by publishing The “Orange” Model of Data Management, The Data Management Toolkit, The Data Management Cookbook, and Data Lineage from a Business Perspective. She is also the author of various white papers and articles on the topic of data management.
Fees
- 2 days
- £895
- £895 + VAT = £1074 (EARLY BIRD PRICE IF YOU BOOK BEFORE 20TH SEPTEMBER)
- 2 days
- £995
- £995 + VAT = £1194 (PRICE AFTER 20TH SEPTEMBER)
Group Booking Discounts
Delegates | |
---|---|
2 - 3 Delegates | 10% discount |
4 - 5 Delegates | 20% discount |
6+ Delegates | 25% discount |