是贪求 发表于 2025-3-23 10:45:32

0302-9743 m 59 submissions. They cover topics such as: data compression; information retrieval; string algorithms; algorithms; computational biology; indexing and compression; and compressed data structures.978-3-030-32685-2978-3-030-32686-9Series ISSN 0302-9743 Series E-ISSN 1611-3349

思想上升 发表于 2025-3-23 16:39:41

Approximation Ratios of ,, , and , on Unary Stringsand .) achieve impressive practical compression results, but the recursive character of those algorithms makes it hard to achieve strong theoretical results. To this end, this paper studies the approximation ratio of those algorithms for unary input strings, which is strongly related to the field of

constitute 发表于 2025-3-23 20:20:36

http://reply.papertrans.cn/88/8798/879720/879720_13.png

食道 发表于 2025-3-23 22:41:03

Rpair: Rescaling RePair with Rsyncmputation over the data without decompressing it. In the best case such a scheme takes a dataset so big that it must be stored on disk and shrinks it enough that it can be stored and processed in internal memory. Even then, however, the scheme is essentially useless unless it can be built on the ori

SLING 发表于 2025-3-24 02:44:52

Position Bias Estimation for Unbiased Learning-to-Rank in eCommerce Searchrom learning-to-rank models. The method takes two steps - estimating click propensities and using them to train unbiased models. Most common methods proposed in the literature for estimating propensities involve some degree of intervention in the live search engine. An alternative approach proposed

enchant 发表于 2025-3-24 08:28:18

BM25 Beyond Query-Document Similarityrm co-occurrence statistics has been widely applied in an attempt to improve retrieval effectiveness. However, selecting good expansion terms using co-occurrence graphs is challenging. In this paper, we present an adapted version of the BM25 model, which allows measuring the similarity between terms

Reclaim 发表于 2025-3-24 12:52:47

Network-Based Pooling for Topic Modeling on Microblog Contentprior to training) has been shown to improve model outputs, but performance varies depending on the pooling scheme and data set used. Here we investigate a new tweet-pooling method based on network structures associated with Twitter content. Using a standard formulation of the well-known Latent Diri

清洗 发表于 2025-3-24 17:34:18

Bounds and Estimates on the Average Edit Distance . denote the average edit distance between random, independent strings of . characters from an alphabet of a given size .. An open problem is the exact value of .. While it is known that, for increasing ., . approaches a limit ., the exact value of this limit is unknown, for any .. This paper prese

Nucleate 发表于 2025-3-24 21:05:57

Compact Data Structures for Shortest Unique Substring Queries (b) . contains the interval [., .] (i.e. .), and (c) every substring . of . with . containing [., .] occurs at least twice in .. Given a query interval ., the . is to output all the SUSs for the interval [., .]. In this article, we propose a . bits data structure answering an interval SUS query in

发怨言 发表于 2025-3-25 00:06:04

http://reply.papertrans.cn/88/8798/879720/879720_20.png
页: 1 [2] 3 4 5 6
查看完整版本: Titlebook: String Processing and Information Retrieval; 26th International S Nieves R. Brisaboa,Simon J. Puglisi Conference proceedings 2019 Springer