Cody’s Data Cleaning Techniques Using SAS
Author: Ron Cody
Publisher: SAS Press
The starting point for many, if not most, SAS projects is data cleaning — identifying and resolving discrepancies and anomolies in a set of data so that it can be analyzed successfully. This book presents a tested set of SAS programming techniques for validating, verifying, and standardizing all kinds of data values. Most of the book consists of examples, drawn from the field of clinical medical research, that demonstrate typical data problems and the techniques for solving them.
2 Paperback
Year: 2008
ISBN: 978-1-59994-659-7
Pages: 264
Publisher’s list price: 42.95
Contents
- Introduction
- 1. Checking Values of Character Variables
- 2. Checking Values of Numeric Variables
- 3. Checking for Missing Values
- 4. Working with Dates
- 5. Looking for Duplicates and “n” Observations per Subject
- 6. Working with Multiple Files
- 7. Double Entry and Verification (PROC COMPARE)
- 8. Some PROC SQL Solutions to Data Cleaning
- 9. Correcting Errors
- 10. Creating Integrity Constraints and Audit Trails
- 11. DataFlux and dfPower Studio
- Appendix. Listing of Raw Data Files and SAS Programs
- Index
1 Paperback
Title: Cody’s Data Cleaning Techniques Using SAS Software
Year: 1999
ISBN: 1-58025-600-7
Pages: 226
Publisher’s list price: 37.95
Contents
- Introduction
- 1. Checking Values of Character Variables
- 2. Checking Values of Numeric Variables
- 3. Checking for Missing Values
- 4. Working with Dates
- 5. Looking for Duplicates and “n” Observations per Subject
- 6. Working with Multiple Files
- 7. Double Entry and Verification (PROC COMPARE)
- 8. Some SQL Solutions to Data Cleaning
- 9. Using Validation Data Sets
- Appendix. Listing of Raw Data Files and SAS Programs
- Index
From the back cover
The key to ensuring accurate data is having clean data. This book develops and describes data cleaning programs and macros. You can use many of the programs and macros that are provided as is, or you can modify them for your own special data cleaning tasks. Ron has carefully explained and documented each of the programs and macros, thus providing you with SAS programming instruction on an intermediate to advanced level. Topics presented include
- validation checks on character data, numeric data, missing values, and date values
- searching for duplicate records
- working with multiple files
- double entry and verification using the COMPARE procedures
- SQL solutions and using validation data sets.
Written in Ron’s signature informal, tutorial style, this book gives anyone who manages data thoroughly documented, step-by-step instructions for the development of data cleaning programs and macros.
Review
Cody±s Data Cleaning Techniques Using SAS Software is the perfect solution for anyone faced with the problems of dealing with messy data. This is an easy-to-follow, very comprehensive exploration of the techniques needed to get data in shape for analysis and reporting. It is an ideal book for the beginning SAS user, loaded with many clear examples of different types of data problems and step-by-step explanations of the solutions.
Intermediate and advanced SAS users will also find it quite beneficial. It provides a solid introduction to new data cleaning features available with SAS versions 7 and 8 and is an ideal refresher for often-overlooked, simple solutions. This book is a “must have” addition to the SAS library of anyone who prepares and cleans data for analysis and reporting.