书目名称 | Data Cleaning | 编辑 | Venkatesh Ganti,Anish Das Sarma | 视频video | | 丛书名称 | Synthesis Lectures on Data Management | 图书封面 |  | 描述 | Data warehouses consolidate various activities of a business and often form the backbone for generating reports that support important business decisions. Errors in data tend to creep in for a variety of reasons. Some of these reasons include errors during input data collection and errors while merging data collected independently across different databases. These errors in data warehouses often result in erroneous upstream reports, and could impact business decisions negatively. Therefore, one of the critical challenges while maintaining large data warehouses is that of ensuring the quality of data in the data warehouse remains high. The process of maintaining high data quality is commonly referred to as data cleaning. In this book, we first discuss the goals of data cleaning. Often, the goals of data cleaning are not well defined and could mean different solutions in different scenarios. Toward clarifying these goals, we abstract out a common set of data cleaning tasks that often need to be addressed. This abstraction allows us to develop solutions for these common data cleaning tasks. We then discuss a few popular approaches for developing such solutions. In particular, we focus | 出版日期 | Book 2013 | 版次 | 1 | doi | https://doi.org/10.1007/978-3-031-01897-8 | isbn_softcover | 978-3-031-00769-9 | isbn_ebook | 978-3-031-01897-8Series ISSN 2153-5418 Series E-ISSN 2153-5426 | issn_series | 2153-5418 | copyright | Springer Nature Switzerland AG 2013 |
The information of publication is updating
|
|