SlideShare a Scribd company logo
1 of 25
Defining Your Data
Quality Project
Scope
Intro
Define your data quality project scope by following these guidelines:
Consider the main product functions and processing modes
Develop your required features
Establish project parameters
Create a budget and timeframe
Establish an evaluation strategy
Define the Main Product Functions
Data Quality product suites span a broad range of functions and in varying
combinations.
Develop an understanding of the features and how they apply to your business
in order to establish what will work for you
The functions listed below are standard in a Data Quality package, and are
listed in order of process flow
Select Main Product Functions
 Standardization
 General ‘cleansing’ functions
 Fixing misspellings, inconsistencies, transpositions and the like
 Moving data across columns, adding state names, zip codes,
titles in places where they are missing
 Address Validation (Verification)
 Matching contact data to standard Postal Address Files (PAF) or
USPS and NCOA Data to validate and update addresses
 CASS Standardization
Select Main Product Functions
 Data Enrichment
 Expanding and enhancing your existing contact data with
additional datasets.
 The variety of datasets includes names data, date of birth, length
of residency, phone and fax numbers, SIC codes, geocoding data
and more.
 Matching/Deduplication
 Matching records within a file or between multiple files for
merging and purging duplicate records, identifying your best
customers or a multiplicity of other reasons.
 A simple count of duplicates, suppressions or records matched is
essentially meaningless – it is the number of true and false
matches that is significant.
Select Main Product Functions
 Record – Linking/ Single Customer View
 ‘Link’ specific records to one another, specifically for the purpose
of creating a single master record (or golden record)
 Master record includes all relevant data for a specific contact
including email preferences, transactions and customer service
history
 Generates the elusive Single Customer View (or 360 Degree
View)
Consider Main Processing Modes
Not all vendors will handle all applications. Consider what processing modes
are critical to your data quality
 Batch (Existing Data)
 Often referred to as “batch data cleansing”
 Cleansing of data already in your database
 Curative measure
 Batch (Load Data)
 Batch processing is also used to match data across files
 Preventative measure
 Real Time (Interactive)
 Tools that work interactively to warn the user entering data that the
information already exists, or if the information is invalid
 Preventative measure
 Real Time (Firewall)
 New records are captured without the user correcting any of the info
 The record is validated and corrected in the background, or logged for
manual attention by someone later
 Preventative Measure
Consider Main Processing Modes
With this information background, the current objective is to identify your
ideal solution based on the business objectives and data quality functions
you need to achieve your goals
Think ahead to your anticipated needs, granularly and globally
Consider larger data projects that may impact the needs of the tools that
you invest in
Processing Needs:
Develop Your Required Features
Here are other items to consider when developing your list of Required
Features:
• Some companies use different terminology for the same feature.
• Some data quality tools are modular and will offer features or
sets of features in individual components with different price
points and installations.
• Consider where a new or improved application or process would
be the best direction to go in
Features Worksheet
Standardization Features Need Want
Correct poorly structured and non – standard records
Identify Foreign Records
Flag inappropriate data in name and address
Flag garbage or incomplete data
Intelligent casing
Salutation generation from names
Address Standardization
Address Verification Capabilities Need Want
Integration of addresses against Postal Address Files/U
Control over updates to postcode/address
Update record with mail format address
Split address completely into component parts
Features Worksheet
Data Enrichment Capabilities Need Want
Append geocoding data
Append consumer data
Append business data
Record – Linking Features Need Want
Grouping/ Linking of matches
Master record identification
Retain information from duplicate records
Reassign orphaned records
Real-time view across databases for inquiry and data
capture
Features Worksheet
Matching and Deduplication Features Need Want
Fuzzy matching
Grading of matches
Tuning of matching rules
Ability to automate matching
Manual review of matches
Multiple level of matches in one pass
Matching on non – standard data
Matching allows for missing and inconsistent data
Effective matching out – of – the – box
Customizable matching reports
Matching files in different formats
Processing Modes Worksheet
Batch (Existing Data) Need Want
Integrated into your database to clean up existing data
Timely and efficient single file matching
Timely and efficient address verification
Batch (Load Data) Need Want
Load new batches of data
Easy to load data in different formats
Rapid matching of small batches of new data against a large
master file
Automatic scheduled operation of solution
Production of standard management and exception reports
Processing Modes Worksheet
Real – Time (Interactive) Need Want
Integrated into your database at point of capture
Real time feedback on data errors
Rapid address entry using Postcode
Intelligent inquiry to find exact matches
Real – Time (Firewall) Need Want
Run on individual records entering the database
Additional Notes:
Establish Project Parameters
Don’t ignore the need for strategies and guidelines to keep both your
vendors and your organization on track
Be flexible as you go through the evaluation process, especially when it
comes to moving parts such as budget and timeframe
Having a plan and some goal parameters in place will be priceless and may
mean the difference between getting the project off the ground or letting
inertia win out
Anticipated Budget (Potential Savings)
Ballpark the potential cost savings of improving your data
Vendors can help with data analysis
Typically there are as many as 10% duplicates in a database. Assume you
have 5% duplicates in your system, start from there
Try to calculate money wasted on advertising, resources needed to handle
customer shipping complaints, or how much more money you would make if
you had more control over marketing
Take a look at the high and low end of vendors you have created on your
shortlist
Rather than call a data quality company and ask for a price, develop your list
and create your price range based on the functions and features you need
Timeframe
At the beginning stages this will be more of an awareness technique rather
than a goal, and will evolve over the course of your evaluation
Seek input from vendors and your internal team to keep a realistic approach
If there is an internal goal that you have set, plan your time by working
backwards from that date
Budget time for all key steps including:
Internal Planning
Searching for vendors
Initial review
Demoing the short list
Internal Decision making
Negotiation
Implementation and Training
Review and Approval Team
Be aware of all of the necessary influencers, decision-makers and budget
approvers that need to be a part of this process
By making the vendor aware of these key departments early on, they will be
able to work with you through the approval process by:
Requesting presentations to all influencers on the team
Making demo software available to all potential users
Helping you with documentation to make the case for a C – level
executive
Establish Your Evaluation Strategy
Evaluate the applications selected.
Knowing your strategy in advance will help you communicate expectations
and guidelines to your vendors, and inform your internal staff and approvals
team so that the process stays on track
Some considerations for this strategy are below
To RFP or Not to RFP
Distribute a Request for Proposals (RFP/RFQ) to a list of vendors, to help with
your evaluation
Submitting a formal bid obligates you to perform a completely fair, balanced
and unbiased evaluation that follows a set of rules and guidelines set out in
the bid
Referrals, the unexpected and sheer gut instinct do not get to play a part,
which ultimately may mean that you may not get to choose your preferred
vendor
Demo Data or Real Data
This will likely be the first question asked of you when making contact with
vendors
Evaluate a solution on your own data.
Sometimes this is not possible right away, or even necessary. You may
have such basic needs that preparing your own data is not necessary
Prepare your sample data accordingly to do a thorough and efficient test of
the software
Who is Driving the Ship
Determine whether the project will be spearheaded by the business or
technology department before starting your evaluation
E.g. If you are from a business department but, after identifying your
requirements, decide that the organization is likely to take an integrated
approach, it may be best to hand off the lead role to a technology
representative (or vice versa)
Gather the Appropriate Documentation and Files
Documents that you should gather before and during this process:
Request for Proposal (if appropriate) using the functional and feature
requirements outlined here
Required Features List (with columns outlined for your individual
shortlist vendors)
Demo data
Review/Approval forms for the members of your team
Budget Spreadsheet
Keep These Things In Mind
Review the list of main product features and processing modes in order to
interpret what functions you need for your data
Establish a timeframe and budget for your project, with input from vendors
and the internal team
Remember to keep vendors and your internal team on the same track in
order to help the process run smoothly
Contact helpIT
US HEADQUARTERS
(The Americas, Australia, New Zealand)
helpIT systems inc.
51 Bedford Rd.
Suite 9
Katonah, NY 10536
United States
US Toll Free: 866.332.7132
US Local: 914.600.7240
Australia: +61 280363191
Fax: 914.232.1429
Email: sales.us@helpIT.com
TECHNICAL SUPPORT
Support: 866.matchIT
Email: support.us@helpIT.com
EUROPEAN HEADQUARTERS
(UK, Europe, Asia)
helpIT systems ltd.
15-17 The Crescent.
LEATHERHEAD
Surrey
KT22 8DY
United Kingdom
Tel: +44 (0) 1372 360070
Fax: +44 (0) 1372 360081
Email: sales.uk@helpIT.com
TECHNICAL SUPPORT
Support: +44 (0) 1372 225904
Email: support@helpIT.com
Registered in England
Registered Office: as above
Company No. 02007292
VAT No. 564228340

More Related Content

Recently uploaded

8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCRashishs7044
 
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckPitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckHajeJanKamps
 
Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Riya Pathan
 
8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCRashishs7044
 
8447779800, Low rate Call girls in Tughlakabad Delhi NCR
8447779800, Low rate Call girls in Tughlakabad Delhi NCR8447779800, Low rate Call girls in Tughlakabad Delhi NCR
8447779800, Low rate Call girls in Tughlakabad Delhi NCRashishs7044
 
Appkodes Tinder Clone Script with Customisable Solutions.pptx
Appkodes Tinder Clone Script with Customisable Solutions.pptxAppkodes Tinder Clone Script with Customisable Solutions.pptx
Appkodes Tinder Clone Script with Customisable Solutions.pptxappkodes
 
PSCC - Capability Statement Presentation
PSCC - Capability Statement PresentationPSCC - Capability Statement Presentation
PSCC - Capability Statement PresentationAnamaria Contreras
 
TriStar Gold Corporate Presentation - April 2024
TriStar Gold Corporate Presentation - April 2024TriStar Gold Corporate Presentation - April 2024
TriStar Gold Corporate Presentation - April 2024Adnet Communications
 
Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Kirill Klimov
 
Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Seta Wicaksana
 
Annual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesAnnual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesKeppelCorporation
 
Buy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail AccountsBuy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail AccountsBuy Verified Accounts
 
Chapter 9 PPT 4th edition.pdf internal audit
Chapter 9 PPT 4th edition.pdf internal auditChapter 9 PPT 4th edition.pdf internal audit
Chapter 9 PPT 4th edition.pdf internal auditNhtLNguyn9
 
Call Us 📲8800102216📞 Call Girls In DLF City Gurgaon
Call Us 📲8800102216📞 Call Girls In DLF City GurgaonCall Us 📲8800102216📞 Call Girls In DLF City Gurgaon
Call Us 📲8800102216📞 Call Girls In DLF City Gurgaoncallgirls2057
 
Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!
Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!
Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!Doge Mining Website
 
APRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdfAPRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdfRbc Rbcua
 

Recently uploaded (20)

Japan IT Week 2024 Brochure by 47Billion (English)
Japan IT Week 2024 Brochure by 47Billion (English)Japan IT Week 2024 Brochure by 47Billion (English)
Japan IT Week 2024 Brochure by 47Billion (English)
 
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
 
No-1 Call Girls In Goa 93193 VIP 73153 Escort service In North Goa Panaji, Ca...
No-1 Call Girls In Goa 93193 VIP 73153 Escort service In North Goa Panaji, Ca...No-1 Call Girls In Goa 93193 VIP 73153 Escort service In North Goa Panaji, Ca...
No-1 Call Girls In Goa 93193 VIP 73153 Escort service In North Goa Panaji, Ca...
 
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckPitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
 
Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737
 
8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR
 
Call Us ➥9319373153▻Call Girls In North Goa
Call Us ➥9319373153▻Call Girls In North GoaCall Us ➥9319373153▻Call Girls In North Goa
Call Us ➥9319373153▻Call Girls In North Goa
 
8447779800, Low rate Call girls in Tughlakabad Delhi NCR
8447779800, Low rate Call girls in Tughlakabad Delhi NCR8447779800, Low rate Call girls in Tughlakabad Delhi NCR
8447779800, Low rate Call girls in Tughlakabad Delhi NCR
 
Appkodes Tinder Clone Script with Customisable Solutions.pptx
Appkodes Tinder Clone Script with Customisable Solutions.pptxAppkodes Tinder Clone Script with Customisable Solutions.pptx
Appkodes Tinder Clone Script with Customisable Solutions.pptx
 
PSCC - Capability Statement Presentation
PSCC - Capability Statement PresentationPSCC - Capability Statement Presentation
PSCC - Capability Statement Presentation
 
TriStar Gold Corporate Presentation - April 2024
TriStar Gold Corporate Presentation - April 2024TriStar Gold Corporate Presentation - April 2024
TriStar Gold Corporate Presentation - April 2024
 
Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024
 
Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...
 
Annual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesAnnual General Meeting Presentation Slides
Annual General Meeting Presentation Slides
 
Buy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail AccountsBuy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail Accounts
 
Corporate Profile 47Billion Information Technology
Corporate Profile 47Billion Information TechnologyCorporate Profile 47Billion Information Technology
Corporate Profile 47Billion Information Technology
 
Chapter 9 PPT 4th edition.pdf internal audit
Chapter 9 PPT 4th edition.pdf internal auditChapter 9 PPT 4th edition.pdf internal audit
Chapter 9 PPT 4th edition.pdf internal audit
 
Call Us 📲8800102216📞 Call Girls In DLF City Gurgaon
Call Us 📲8800102216📞 Call Girls In DLF City GurgaonCall Us 📲8800102216📞 Call Girls In DLF City Gurgaon
Call Us 📲8800102216📞 Call Girls In DLF City Gurgaon
 
Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!
Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!
Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!
 
APRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdfAPRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdf
 

Featured

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationErica Santiago
 

Featured (20)

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy Presentation
 

Defining your Data Quality Project Scope

  • 2. Intro Define your data quality project scope by following these guidelines: Consider the main product functions and processing modes Develop your required features Establish project parameters Create a budget and timeframe Establish an evaluation strategy
  • 3. Define the Main Product Functions Data Quality product suites span a broad range of functions and in varying combinations. Develop an understanding of the features and how they apply to your business in order to establish what will work for you The functions listed below are standard in a Data Quality package, and are listed in order of process flow
  • 4. Select Main Product Functions  Standardization  General ‘cleansing’ functions  Fixing misspellings, inconsistencies, transpositions and the like  Moving data across columns, adding state names, zip codes, titles in places where they are missing  Address Validation (Verification)  Matching contact data to standard Postal Address Files (PAF) or USPS and NCOA Data to validate and update addresses  CASS Standardization
  • 5. Select Main Product Functions  Data Enrichment  Expanding and enhancing your existing contact data with additional datasets.  The variety of datasets includes names data, date of birth, length of residency, phone and fax numbers, SIC codes, geocoding data and more.  Matching/Deduplication  Matching records within a file or between multiple files for merging and purging duplicate records, identifying your best customers or a multiplicity of other reasons.  A simple count of duplicates, suppressions or records matched is essentially meaningless – it is the number of true and false matches that is significant.
  • 6. Select Main Product Functions  Record – Linking/ Single Customer View  ‘Link’ specific records to one another, specifically for the purpose of creating a single master record (or golden record)  Master record includes all relevant data for a specific contact including email preferences, transactions and customer service history  Generates the elusive Single Customer View (or 360 Degree View)
  • 7. Consider Main Processing Modes Not all vendors will handle all applications. Consider what processing modes are critical to your data quality  Batch (Existing Data)  Often referred to as “batch data cleansing”  Cleansing of data already in your database  Curative measure  Batch (Load Data)  Batch processing is also used to match data across files  Preventative measure  Real Time (Interactive)  Tools that work interactively to warn the user entering data that the information already exists, or if the information is invalid  Preventative measure  Real Time (Firewall)  New records are captured without the user correcting any of the info  The record is validated and corrected in the background, or logged for manual attention by someone later  Preventative Measure
  • 8. Consider Main Processing Modes With this information background, the current objective is to identify your ideal solution based on the business objectives and data quality functions you need to achieve your goals Think ahead to your anticipated needs, granularly and globally Consider larger data projects that may impact the needs of the tools that you invest in Processing Needs:
  • 9. Develop Your Required Features Here are other items to consider when developing your list of Required Features: • Some companies use different terminology for the same feature. • Some data quality tools are modular and will offer features or sets of features in individual components with different price points and installations. • Consider where a new or improved application or process would be the best direction to go in
  • 10. Features Worksheet Standardization Features Need Want Correct poorly structured and non – standard records Identify Foreign Records Flag inappropriate data in name and address Flag garbage or incomplete data Intelligent casing Salutation generation from names Address Standardization Address Verification Capabilities Need Want Integration of addresses against Postal Address Files/U Control over updates to postcode/address Update record with mail format address Split address completely into component parts
  • 11. Features Worksheet Data Enrichment Capabilities Need Want Append geocoding data Append consumer data Append business data Record – Linking Features Need Want Grouping/ Linking of matches Master record identification Retain information from duplicate records Reassign orphaned records Real-time view across databases for inquiry and data capture
  • 12. Features Worksheet Matching and Deduplication Features Need Want Fuzzy matching Grading of matches Tuning of matching rules Ability to automate matching Manual review of matches Multiple level of matches in one pass Matching on non – standard data Matching allows for missing and inconsistent data Effective matching out – of – the – box Customizable matching reports Matching files in different formats
  • 13. Processing Modes Worksheet Batch (Existing Data) Need Want Integrated into your database to clean up existing data Timely and efficient single file matching Timely and efficient address verification Batch (Load Data) Need Want Load new batches of data Easy to load data in different formats Rapid matching of small batches of new data against a large master file Automatic scheduled operation of solution Production of standard management and exception reports
  • 14. Processing Modes Worksheet Real – Time (Interactive) Need Want Integrated into your database at point of capture Real time feedback on data errors Rapid address entry using Postcode Intelligent inquiry to find exact matches Real – Time (Firewall) Need Want Run on individual records entering the database Additional Notes:
  • 15. Establish Project Parameters Don’t ignore the need for strategies and guidelines to keep both your vendors and your organization on track Be flexible as you go through the evaluation process, especially when it comes to moving parts such as budget and timeframe Having a plan and some goal parameters in place will be priceless and may mean the difference between getting the project off the ground or letting inertia win out
  • 16. Anticipated Budget (Potential Savings) Ballpark the potential cost savings of improving your data Vendors can help with data analysis Typically there are as many as 10% duplicates in a database. Assume you have 5% duplicates in your system, start from there Try to calculate money wasted on advertising, resources needed to handle customer shipping complaints, or how much more money you would make if you had more control over marketing Take a look at the high and low end of vendors you have created on your shortlist Rather than call a data quality company and ask for a price, develop your list and create your price range based on the functions and features you need
  • 17. Timeframe At the beginning stages this will be more of an awareness technique rather than a goal, and will evolve over the course of your evaluation Seek input from vendors and your internal team to keep a realistic approach If there is an internal goal that you have set, plan your time by working backwards from that date Budget time for all key steps including: Internal Planning Searching for vendors Initial review Demoing the short list Internal Decision making Negotiation Implementation and Training
  • 18. Review and Approval Team Be aware of all of the necessary influencers, decision-makers and budget approvers that need to be a part of this process By making the vendor aware of these key departments early on, they will be able to work with you through the approval process by: Requesting presentations to all influencers on the team Making demo software available to all potential users Helping you with documentation to make the case for a C – level executive
  • 19. Establish Your Evaluation Strategy Evaluate the applications selected. Knowing your strategy in advance will help you communicate expectations and guidelines to your vendors, and inform your internal staff and approvals team so that the process stays on track Some considerations for this strategy are below
  • 20. To RFP or Not to RFP Distribute a Request for Proposals (RFP/RFQ) to a list of vendors, to help with your evaluation Submitting a formal bid obligates you to perform a completely fair, balanced and unbiased evaluation that follows a set of rules and guidelines set out in the bid Referrals, the unexpected and sheer gut instinct do not get to play a part, which ultimately may mean that you may not get to choose your preferred vendor
  • 21. Demo Data or Real Data This will likely be the first question asked of you when making contact with vendors Evaluate a solution on your own data. Sometimes this is not possible right away, or even necessary. You may have such basic needs that preparing your own data is not necessary Prepare your sample data accordingly to do a thorough and efficient test of the software
  • 22. Who is Driving the Ship Determine whether the project will be spearheaded by the business or technology department before starting your evaluation E.g. If you are from a business department but, after identifying your requirements, decide that the organization is likely to take an integrated approach, it may be best to hand off the lead role to a technology representative (or vice versa)
  • 23. Gather the Appropriate Documentation and Files Documents that you should gather before and during this process: Request for Proposal (if appropriate) using the functional and feature requirements outlined here Required Features List (with columns outlined for your individual shortlist vendors) Demo data Review/Approval forms for the members of your team Budget Spreadsheet
  • 24. Keep These Things In Mind Review the list of main product features and processing modes in order to interpret what functions you need for your data Establish a timeframe and budget for your project, with input from vendors and the internal team Remember to keep vendors and your internal team on the same track in order to help the process run smoothly
  • 25. Contact helpIT US HEADQUARTERS (The Americas, Australia, New Zealand) helpIT systems inc. 51 Bedford Rd. Suite 9 Katonah, NY 10536 United States US Toll Free: 866.332.7132 US Local: 914.600.7240 Australia: +61 280363191 Fax: 914.232.1429 Email: sales.us@helpIT.com TECHNICAL SUPPORT Support: 866.matchIT Email: support.us@helpIT.com EUROPEAN HEADQUARTERS (UK, Europe, Asia) helpIT systems ltd. 15-17 The Crescent. LEATHERHEAD Surrey KT22 8DY United Kingdom Tel: +44 (0) 1372 360070 Fax: +44 (0) 1372 360081 Email: sales.uk@helpIT.com TECHNICAL SUPPORT Support: +44 (0) 1372 225904 Email: support@helpIT.com Registered in England Registered Office: as above Company No. 02007292 VAT No. 564228340