男生如果明白 发表于 2025-3-25 04:06:45
Interestingness Measures for Exploratory Data Analysis: a Survey EDA were recently proposed. They all rely on interestingness measures to score the importance of insights. This paper surveys and categorizes the different interestingness measures proposed in the literature for approaches aiming at automating EDA. The lessons learned from this survey allow to poinJEER 发表于 2025-3-25 09:25:02
http://reply.papertrans.cn/67/6697/669665/669665_22.png庄严 发表于 2025-3-25 13:21:28
http://reply.papertrans.cn/67/6697/669665/669665_23.pngPSA-velocity 发表于 2025-3-25 17:54:59
Estimating , with , and Machine Learningally for large time series. We propose a technique for the approximate computation of . that uses the . representation of the time series to quickly estimate the Nearest Neighbor (NN) distance of each subsequence, and then applies a Machine Learning model to correct the accuracy loss incurred. Our mDysarthria 发表于 2025-3-25 22:36:30
Entity Matching with Large Language Models as Weak and Strong Labellersile acting in a zero shot manner, thus reducing the need for labelled training data. However, the use of the most capable LLMs (e.g. GPT-4) comes at a strongly prohibitive cost, whilst weaker LLMs (e.g. GPT-3) are significantly cheaper yet still provide reasonable performance without training data.Firefly 发表于 2025-3-26 00:57:46
LLMClean: Context-Aware Tabular Data Cleaning via LLM-Generated OFDsenting data quality. Nevertheless, crafting these context models is a demanding task, both in terms of resources and expertise, often necessitating specialized knowledge from domain experts. This paper introduces an innovative approach, called LLMClean, for the automated generation of context modelsVulnerable 发表于 2025-3-26 07:17:00
http://reply.papertrans.cn/67/6697/669665/669665_27.pngreaching 发表于 2025-3-26 11:17:55
http://reply.papertrans.cn/67/6697/669665/669665_28.pngKernel 发表于 2025-3-26 15:31:41
Construction of Open Data Sources for Data Interoperability in Brazilian Health Information Systemses conducted to process and link such data sources are based on corpora and databases in the English language. This hinders an adaptation to non-English speaking countries, not only because of the language but also due to the lack of authoritative integrated bases in these countries - such as Brazil充满装饰 发表于 2025-3-26 17:58:29
Brazilian Political Study with Topics Analysis and Complex Networkseputies. The present work expands the social analysis of this system, usually analyzed with complex networks, to also consider the ideological aspect of the house by conducting a topic analysis to generate thematic complex networks that are analyzed to uncover how the theme of a voting proposition a