找回密码
 To register

QQ登录

只需一步,快速开始

扫一扫,访问微社区

Titlebook: Data Profiling; Ziawasch Abedjan,Lukasz Golab,Thorsten Papenbrock Book 2019 Springer Nature Switzerland AG 2019

[复制链接]
查看: 55105|回复: 44
发表于 2025-3-21 16:25:52 | 显示全部楼层 |阅读模式
书目名称Data Profiling
编辑Ziawasch Abedjan,Lukasz Golab,Thorsten Papenbrock
视频video
丛书名称Synthesis Lectures on Data Management
图书封面Titlebook: Data Profiling;  Ziawasch Abedjan,Lukasz Golab,Thorsten Papenbrock Book 2019 Springer Nature Switzerland AG 2019
描述.Data profiling refers to the activity of collecting data about data, {i.e.}, metadata. Most IT professionals and researchers who work with data have engaged in data profiling, at least informally, to understand and explore an unfamiliar dataset or to determine whether a new dataset is appropriate for a particular task at hand. Data profiling results are also important in a variety of other situations, including query optimization, data integration, and data cleaning. Simple metadata are statistics, such as the number of rows and columns, schema and datatype information, the number of distinct values, statistical value distributions, and the number of null or empty values in each column. More complex types of metadata are statements about multiple columns and their correlation, such as candidate keys, functional dependencies, and other types of dependencies...This book provides a classification of the various types of profilable metadata, discusses popular data profiling tasks,and surveys state-of-the-art profiling algorithms. While most of the book focuses on tasks and algorithms for relational data profiling, we also briefly discuss systems and techniques for profiling non-relati
出版日期Book 2019
版次1
doihttps://doi.org/10.1007/978-3-031-01865-7
isbn_softcover978-3-031-00737-8
isbn_ebook978-3-031-01865-7Series ISSN 2153-5418 Series E-ISSN 2153-5426
issn_series 2153-5418
copyrightSpringer Nature Switzerland AG 2019
The information of publication is updating

书目名称Data Profiling影响因子(影响力)




书目名称Data Profiling影响因子(影响力)学科排名




书目名称Data Profiling网络公开度




书目名称Data Profiling网络公开度学科排名




书目名称Data Profiling被引频次




书目名称Data Profiling被引频次学科排名




书目名称Data Profiling年度引用




书目名称Data Profiling年度引用学科排名




书目名称Data Profiling读者反馈




书目名称Data Profiling读者反馈学科排名




单选投票, 共有 0 人参与投票
 

0票 0%

Perfect with Aesthetics

 

0票 0%

Better Implies Difficulty

 

0票 0%

Good and Satisfactory

 

0票 0%

Adverse Performance

 

0票 0%

Disdainful Garbage

您所在的用户组没有投票权限
发表于 2025-3-22 00:09:44 | 显示全部楼层
发表于 2025-3-22 03:52:08 | 显示全部楼层
发表于 2025-3-22 06:13:16 | 显示全部楼层
Discovering Metadata, science and data analytics, and with the realization that business insight may be extracted from data, has brought many datasets into organizations’ data lakes and data reservoirs. Data profiling helps understand and prepare data for subsequent cleansing, integration, and analysis.
发表于 2025-3-22 12:08:54 | 显示全部楼层
Data Profiling Tasks, individual columns, those which identify dependencies across columns, and those which examine non-relational data such as trees, graphs or text. The classes are explained in the following subsections, where we also discuss the relationship between data profiling and data mining.
发表于 2025-3-22 13:15:48 | 显示全部楼层
Regulation? — or Discrimination?, semi-structured data such as XML and RDF and non-structured data such as text. In this chapter, we describe two types of solutions: those which apply traditional data profiling algorithms to new types of data and those which develop new approaches to profiling non-relational data.
发表于 2025-3-22 18:25:38 | 显示全部楼层
发表于 2025-3-22 22:41:13 | 显示全部楼层
发表于 2025-3-23 05:04:51 | 显示全部楼层
发表于 2025-3-23 08:42:57 | 显示全部楼层
 关于派博传思  派博传思旗下网站  友情链接
派博传思介绍 公司地理位置 论文服务流程 影响因子官网 SITEMAP 大讲堂 北京大学 Oxford Uni. Harvard Uni.
发展历史沿革 期刊点评 投稿经验总结 SCIENCEGARD IMPACTFACTOR 派博系数 清华大学 Yale Uni. Stanford Uni.
|Archiver|手机版|小黑屋| 派博传思国际 ( 京公网安备110108008328) GMT+8, 2025-5-26 05:08
Copyright © 2001-2015 派博传思   京公网安备110108008328 版权所有 All rights reserved
快速回复 返回顶部 返回列表