Titlebook: Computer Engineering and Technology; 20th CCF Conference, Weixia Xu,Liquan Xiao,Zhenzhen Zhu Conference proceedings 2016 Springer Nature Si

显示全部楼层 · 发表于 2025-3-28 20:18:31

Language-Extension-Based Vectorizing Compiling Scheme on SDR-DSP We use LEVCS to vectorize five benchmark kernels: Fast Fourier Transform (FFT), Finite Impulse Responsefilter (FIR) and Infinite Impulse Response filter (IIR), Dot product implementation (Dotprod), Sum of vectors (vecsum). Experiment results show that LEVCS is functional correct and can achieve 2.883–8.074 speedups comparing to TI-DSPs.

显示全部楼层 · 发表于 2025-3-29 00:26:42

A Dynamic Multi-precision Fixed-Point Data Quantization Strategy for Convolutional Neural Network2% to 5.9% at most, compared with previous static quantization strategy, when 8/4-bit quantization is used. When 16-bit quantization is used, only 0.03% accuracy loss is introduced by our quantization strategy with half memory footprint and bandwidth requirement comparing with 32-bit floating-point implementation.

显示全部楼层 · 发表于 2025-3-29 05:01:12

显示全部楼层 · 发表于 2025-3-29 07:22:03

显示全部楼层 · 发表于 2025-3-29 12:33:50

Monaural Speech Separation on Many Integrated Core Architecturehitecture to meet the requirement of real-time speech separation. This approach conducts parallelism based on the OpenMP technology, and performs the computing intensitive matrix manipulations on a MIC coprocessor. The experimental results confirm the efficiency of our implementation of monaural speech separation on MIC architecture.

显示全部楼层 · 发表于 2025-3-29 18:49:30

Single/Double Precision Floating-Point Division and Square Root Unit Based on SRT-8 Algorithmde the latency of look-up table, generating fast addend was used to decrease critical path, and “On-the-fly” conversion was employed for saving area-cost. Experimental results show that our proposed design can achieve low latency and low hardware overhead.

显示全部楼层 · 发表于 2025-3-29 23:45:04

A Methodology for Performance Verification of Microprocessorstion and RTL simulation based benchmarks are made at the core-level. Prototyping and counter-based performance analysis systems are built in the system level. An example is given to demonstrate the application and effectiveness of the proposed methodology.

显示全部楼层 · 发表于 2025-3-30 02:50:33

显示全部楼层 · 发表于 2025-3-30 05:18:52

A New DVFS Algorithm Design for Multi-core Processor Chiptional single-threshold algorithm, experimental results show that dual-threshold adaptive DVFS can save more power with no obviously performance reduction. The performance of most benchmarks is beyond 90% of the original performance, while the power optimization can be up to 35%.

		自动登录	找回密码
密码			To register

关于派博传思			派博传思旗下网站			友情链接
派博传思介绍	公司地理位置	论文服务流程	影响因子官网	吾爱论文网	大讲堂	北京大学	Oxford Uni.	Harvard Uni.
发展历史沿革	期刊点评	投稿经验总结	SCIENCEGARD	IMPACTFACTOR	派博系数	清华大学	Yale Uni.	Stanford Uni.
\|Archiver\|手机版\|小黑屋\| 派博传思国际 ( 京公网安备110108008328) GMT+8, 2026-2-9 22:54
Copyright © 2001-2015 派博传思京公网安备110108008328 版权所有 All rights reserved

Titlebook: Computer Engineering and Technology; 20th CCF Conference, Weixia Xu,Liquan Xiao,Zhenzhen Zhu Conference proceedings 2016 Springer Nature Si

浏览过的版块