找回密码
 To register

QQ登录

只需一步,快速开始

扫一扫,访问微社区

Titlebook: Computer Engineering and Technology; 20th CCF Conference, Weixia Xu,Liquan Xiao,Zhenzhen Zhu Conference proceedings 2016 Springer Nature Si

[复制链接]
楼主: osteomalacia
发表于 2025-3-28 15:26:35 | 显示全部楼层
发表于 2025-3-28 20:18:31 | 显示全部楼层
Language-Extension-Based Vectorizing Compiling Scheme on SDR-DSP We use LEVCS to vectorize five benchmark kernels: Fast Fourier Transform (FFT), Finite Impulse Responsefilter (FIR) and Infinite Impulse Response filter (IIR), Dot product implementation (Dotprod), Sum of vectors (vecsum). Experiment results show that LEVCS is functional correct and can achieve 2.883–8.074 speedups comparing to TI-DSPs.
发表于 2025-3-29 00:26:42 | 显示全部楼层
A Dynamic Multi-precision Fixed-Point Data Quantization Strategy for Convolutional Neural Network2% to 5.9% at most, compared with previous static quantization strategy, when 8/4-bit quantization is used. When 16-bit quantization is used, only 0.03% accuracy loss is introduced by our quantization strategy with half memory footprint and bandwidth requirement comparing with 32-bit floating-point implementation.
发表于 2025-3-29 05:01:12 | 显示全部楼层
发表于 2025-3-29 07:22:03 | 显示全部楼层
发表于 2025-3-29 12:33:50 | 显示全部楼层
Monaural Speech Separation on Many Integrated Core Architecturehitecture to meet the requirement of real-time speech separation. This approach conducts parallelism based on the OpenMP technology, and performs the computing intensitive matrix manipulations on a MIC coprocessor. The experimental results confirm the efficiency of our implementation of monaural speech separation on MIC architecture.
发表于 2025-3-29 18:49:30 | 显示全部楼层
Single/Double Precision Floating-Point Division and Square Root Unit Based on SRT-8 Algorithmde the latency of look-up table, generating fast addend was used to decrease critical path, and “On-the-fly” conversion was employed for saving area-cost. Experimental results show that our proposed design can achieve low latency and low hardware overhead.
发表于 2025-3-29 23:45:04 | 显示全部楼层
A Methodology for Performance Verification of Microprocessorstion and RTL simulation based benchmarks are made at the core-level. Prototyping and counter-based performance analysis systems are built in the system level. An example is given to demonstrate the application and effectiveness of the proposed methodology.
发表于 2025-3-30 02:50:33 | 显示全部楼层
发表于 2025-3-30 05:18:52 | 显示全部楼层
A New DVFS Algorithm Design for Multi-core Processor Chiptional single-threshold algorithm, experimental results show that dual-threshold adaptive DVFS can save more power with no obviously performance reduction. The performance of most benchmarks is beyond 90% of the original performance, while the power optimization can be up to 35%.
 关于派博传思  派博传思旗下网站  友情链接
派博传思介绍 公司地理位置 论文服务流程 影响因子官网 SITEMAP 大讲堂 北京大学 Oxford Uni. Harvard Uni.
发展历史沿革 期刊点评 投稿经验总结 SCIENCEGARD IMPACTFACTOR 派博系数 清华大学 Yale Uni. Stanford Uni.
|Archiver|手机版|小黑屋| 派博传思国际 ( 京公网安备110108008328) GMT+8, 2025-5-5 00:33
Copyright © 2001-2015 派博传思   京公网安备110108008328 版权所有 All rights reserved
快速回复 返回顶部 返回列表