Special Column-High Performance Computering

Sparse matrix compressed storage format based on GPU

DOI:10.16157/j.issn.0258-7998.245825

Author:Chen Minhao,Bian Haodong

Author Affilications:School of Computer Technology and Application, Qinghai University

Abstract:Sparse Matrix-Vector Multiplication (SpMV) is an important linear algebraic subroutine in Matrix numerical computation. Vectorized Compressed Sparse Row (VCSR) sparse matrix compression format is proposed by studying the load balancing and memory access frequency of SpMV algorithm. This format adjusts the data load of each thread according to the statistical characteristics of the distribution of each line of non-zero elements to prevent the problem of thread divergence, and improves the computational performance of SpMV flow based on the strategy of fast segmented summation and the vectorization method. By using the Sparse matrix of the University of Florida as the test set, the performance of the GPU is tested, and the average performance improvement is 10% to 30%, and the maximum performance is 50% compared to the CSR5 (Compressed Sparse Row 5) format.
Key word:
SpMV
load balancing
storage format
segmented sum methods
floating-point calculation
vectorization
GPU

Parallel quantum simulation method based on circuit cutting approach

DOI:10.16157/j.issn.0258-7998.245854

Author:Zhou Yukai1,Peng Shixin1,Yan Jun2,Jiang Jinhu1

Author Affilications:1.Institute of Big Data, Fudan University; 2.Teaching and Support Center, Information Engineering University

Abstract:Quantum computing has shown great potential in addressing traditional computational challenges, but due to its high error rates and noise issues, classical simulation has become an essential tool for verifying its performance. However, the superposition and entanglement properties of quantum systems pose significant challenges for simulation, especially when memory is limited. Although circuit cutting methods can decompose large-scale quantum circuits into smaller computational tasks to reduce computational load, previous research primarily focused on their application to quantum computers, without fully considering their effectiveness in quantum circuit simulation. This study fills that gap by proposing an optimization scheme based on a heuristic cutting algorithm and subcircuit state vector reuse to address memory limitations in simulations. By incorporating global computational cost considerations and an integer programming model, the heuristic method proposed in this paper not only optimizes the cutting process but also combines subcircuit state vector reuse to reduce redundant calculations and memory usage. Experimental results show that compared to current popular circuit cutting methods, the proposed approach significantly improves simulation speed while reducing memory requirements, effectively addressing the challenges in quantum circuit simulation. The overall average speedup achieved 46%.
Key word:
quantum computing
quantum simulator
quantum circuit
circuit cutting

Gene heuristic multi sequence alignment algorithm based on Yarn cloud platform

DOI:10.16157/j.issn.0258-7998.245448

Author:Yang Bo1,Xu Shengchao1,Zhou Jipeng2,Wang Zhijian1

Author Affilications:1.School of Artificial Intelligent, Guangzhou Huashang College; 2.School of Information Science and Technology, Jinan University

Abstract:This paper proposes a gene heuristic multi sequence alignment algorithm based on the Yarn cloud platform. Establish a nucleic acid replacement equivalence matrix as a genetic heuristic mathematical model, construct the Yarn cloud platform logical architecture, and classify and save the data through steps such as gene data preprocessing, gene data storage, gene data alignment, gene data management, and gene data analysis. Divide long sequences with high error rates, and obtain multiple shorter gene fragments. Implementing localization on different fragments, generating variable length seeds, constructing skeletons and filling gaps, can achieve gene heuristic multi sequence alignment. The results show that the designed algorithm reduces processing time on different datasets, and the sum of pairs (SP) score for multi sequence alignment is higher. This experiment verifies the practicality of the multi sequence alignment method.
Key word:
biological data
parallel computing
distributed computing architecture
distributed database system
big data processing platform

Application and optimization of sparse matrix vector multiplication on C66x

DOI:10.16157/j.issn.0258-7998.244858

Author:Huang Xudong,Hong Ze,Chen Zhenjiao

Author Affilications:China Electronics Technology Group Corporation No.58 Research Institute

Abstract:With the explosive development of big data, sparse matrix has become an important part of machine learning and edge computing. In the field of machine learning, sparse matrix of data sets can not only save information but also save memory, which has become an inevitable trend. Sparse matrix vector multiplication (SpMV) is the core of sparse matrix computation. The space complexity and time complexity of its iterative solution process have important research significance. Analyze the compression format of sparse matrix C00, CSR, ELLPACK and DIA, change the sparsity of sparse matrix and the distribution of non-zero elements, and conclude that the SpMV read by COO and calculated by CSR is more universal. Utilizing the VLIW instruction architecture of C66x, using software pipelining to manage SpMV_CSR algorithm for instruction parallel optimization, utilizing SIMD single instruction multiple data instruction set for SpMV_CSR algorithm completes data parallel optimization. The experimental results indicate that the optimized SpMV_CSR algorithm has an average acceleration ratio of over 5 times compared to before optimization.
Key word:
sparse matrix
SpMV
CSR
C66x
software pipelining
SIMD

Artificial Intelligence

Event reconstruction algorithm based on Transformer residual network

DOI:10.16157/j.issn.0258-7998.245292

Author:Wang Lixi1,Liu Yunping1,Tang Qinqin2,Li Jiahao1

Author Affilications:(1.School of Automation, Nanjing University of Information Science & Technology; 2.School of Rail Transportation, Wuxi University

Abstract:Current artificial visual systems still struggle to handle real-world scenarios involving high-speed motion and high dynamic range scenes. Event cameras have the capability to address these challenges due to their low latency and high dynamic range for capturing fast-moving objects. However, reconstructing events into videos while maintaining their speed presents a challenge due to the highly sparse and dynamic nature of event data. Therefore, this paper proposes an event stream reconstruction algorithm based on Transformer residual networks and optical flow estimation. By jointly training optical flow estimation and event reconstruction, a self-supervised reconstruction process has been achieved. Additionally, deblurring preprocessing and subpixel upsampling modules are introduced to enhance the quality of reconstruction. Experimental results demonstrate that the proposed approach effectively improves the reconstruction quality of event streams on public datasets.
Key word:
event camera
video reconstruction
deep learning
optical flow estimation

Load prediction and optimization of renewable energy access to the distribution network

DOI:10.16157/j.issn.0258-7998.245284

Author:Zhai Zhe1,Yu Jiewen2,Du Yang3,Cao Zejiang4

Author Affilications:1.Dispatching and Control Center, China Southern Power Grid; 2.China Southern Power Grid Artificial Intelligence Technology Co., Ltd.; 3.Shenzhen Faben Information Technology Co., Ltd.; 4.China Southern Power Grid Digital Power Grid Technology (Guangdong) Co., Ltd.

Abstract:Currently, with the large-scale integration of renewable energy into the distribution network, the intermittency and randomness of renewable energy sources such as solar and wind power inevitably cause fluctuations in the distribution network. Considering the characteristics of renewable energy generation power and electricity load in the power grid over time, a load prediction and optimization method based on wavelet transform and neural network for renewable energy access to the distribution network is proposed. Firstly, the grid operation data are collected, and the wavelet transform is used to process the collected data to obtain the feature parameters of local scale and frequency decomposition. A neural network is established. Then the feature parameters obtained after the wavelet transform are trained to obtain a model capable of predicting the load, according to which the power generation of renewable energy sources can be adjusted in time to maintain the dynamic balance between the supply and demand sides of the distribution network. The results show that the proposed method can effectively predict the load and regulate the power generation by observing the load in advance to ensure the stability of power consumption in the distribution network and simultaneously maximize the use of renewable energy.
Key word:
cloud technology
neural network
wavelet transform
wind and solar power generation
load prediction
power generation optimization

Research on face recognition based on improved AlexNet convolutional neural network

DOI:10.16157/j.issn.0258-7998.245086

Author:Cai Jing,Gu Chengrui,Liu Guangda,Sun Huihui

Author Affilications:College of Instrumentation & Electrical Engineering, Jilin University

Abstract:In recent years, face recognition has been a hot topic in society. Compared to contact-based recognition methods such as fingerprint recognition, face recognition offers the advantage of being contactless. In the field of deep learning, traditional convolutional neural networks do not achieve high enough accuracy or speed for face recognition. Therefore, this paper proposes a face recognition algorithm using the AlexNet convolutional neural network. Experimental results show that AlexNet provides higher accuracy and more stability in face recognition compared to traditional convolutional neural networks.
Key word:
deep learning
convolutional neural network
face recognition
AlexNet

Microelectronic Technology

Design and implementation of a secure bus bridge based on ARM architecture

DOI:10.16157/j.issn.0258-7998.245185

Author:Liu Mengying,Wang Fenfen,Feng Haiying

Author Affilications:China Key System & Integrated Circuit Co., Ltd.

Abstract:To improve the security performance of SoC systems, this paper presents a design of a secure bus bridge circuit based on the ARM architecture. The circuit consists of a bridge module and an access control function module. Simulation results indicate that the circuit is capable of performing protocol conversion from AHB to APB under arbitrary multiples of AHB and APB clock frequencies. It also supports flexible configuration of access permissions and protection modes for master and slave devices. This prevents unauthorized access and malicious tampering of peripheral register data, thereby improving the security performance and applicability of the SoC system.
Key word:
ARM
AHB
APB
security
access-control

Analysis of the mechanism and preventive measures of self-excited oscillation of amplifier in a receiver

DOI:10.16157/j.issn.0258-7998.245673

Author:Yu Xiaohui,Ma Zelong

Author Affilications:The 13th Research Institute of China Electronics Technology Group Corporation

Abstract:The fault of higher level of IF-Noise was successfully handled for a receiver. The result shows that individual differences exist within the RF module cavity. It causes the self-excited oscillation of the local oscillator amplifier in pulsed mode. Preventive measures are given after analysis of the mechanism.Attach wave-absorbing materials to the sidewalls of the RF box can effectively eliminate self-excited oscillation.This article has value for the analysis and treatment of cavity self-excited oscillation in the future.
Key word:
amplifier
mechanism of self-excited oscillation
wave-absorbing materials
preventive measures

Measurement Control Technology and Instruments

Research on anti-magnetic interference algorithm of Hall current sensor with circular array

DOI:10.16157/j.issn.0258-7998.245475

Author:Quan Shuo1,Tang Yue1,2,Chu Ziyang1,Shen Yue1

Author Affilications:1.School of Automation, Nanjing University of Information Science and Technology; 2.School of Internet of Things Engineering, Wuxi University

Abstract:Hall array current sensor is a research hotspot in the field of power electronics, especially in the measurement of direct current. It has the advantages of high linearity, large dynamic range, small size and low power consumption, which is an improvement of current sensor with ferromagnetic core. However, due to the absence of core poly-magnetic, in automotive electronics, distribution systems and other places, the conductors are densely arranged, and the external magnetic field is easy to enter the magnetic sensor, thus affecting the measurement accuracy. Therefore, this paper proposes an antimagnetic interference algorithm to solve this problem, using the method of constructing transcendental equations, which can calculate not only the measured current, but also the interference current at any position. The method is verified by numerical simulation, finite element analysis and laboratory experiments. The experimental results show that the error is less than 3.5% under the influence of strong interference current, which verifies the effectiveness of the circular sensor array and anti-magnetic interference algorithm.
Key word:
circular array
Hall current sensor
current measurement
anti-magnetic interference algorithm

Partial discharge fault diagnosis based on residual attention adaptive denoising network and stacking ensemble learning

DOI:10.16157/j.issn.0258-7998.245495

Author:Liao Xiaoqing1,Chen Li1,Xu Jianyuan1,Jin Baoquan1,Jiang Zichao1,Liu Junfeng2

Author Affilications:1.Maoming Power Supply Bureau of Guangdong Power Grid Co., Ltd.; 2.School of Automation Science and Engineering, South China University of Technology

Abstract:To overcome the challenges posed by traditional Partial Discharge (PD) fault diagnosis methods, such as their inability to effectively process complex, noisy PD signals and their reliance on manual denoising and expert input, and the difficulty on learning diverse PD feature representations, this paper introduces two advanced solutions: Residual Attention Adaptive Denoising Network (RAADNet) and Stacking ensemble-based PD fault diagnosis model. RAADNet leverages a residual network framework integrated with a Channel Attention Module (CAM) and a soft thresholding function for adaptive noise reduction. The Stacking ensemble model comprises distinct base learners, including the RAADNet with convolutional neural network architecture, a Transformer featuring multi-head self-attention, and an XGBoost model that adopts a Boosting strategy. Experimental findings reveal that RAADNet surpasses competing advanced techniques, achieving an accuracy of 93.99%. The Stacking model further improves performance by leveraging diverse feature representations, reaching an accuracy of 96.79%.
Key word:
gas insulated switchgear
partial discharge
Stacking ensemble learning
convolutional neural network
Transformer

Communication and Network

Design on PSFP method for TSN switch

DOI:10.16157/j.issn.0258-7998.245540

Author:Shi Huanhuan,Feng Mumu

Author Affilications:China Electronics Technology Group Corporation No.58 Research Institute

Abstract:Time Sensitive Network (TSN) combines the flexibility of traditional Ethernet “Best Effort” transmission methods with the real-time and deterministic characteristics of time sensitive networks, and has the ability to support different types of applications. The Per-Stream Filtering and Policing (PSFP) protocol of TSN can prevent illegal packet attacks and traffic overload by providing certain strategies. Based on the working principle of the protocol, the functions of stream filters, stream gates and flow metering are studied, and an implementation method of PSFP is proposed. Besides, the network topology is designed and verified through experiments on TSN switches. The results show that this design can successfully match abnormal packets that meet the rules, and block or limit the bandwidth of them according to requirements, which can prove the effectiveness of the design. This method is not limited to TSN implemented by hardware or software, and has certain reference value for the application of TSN in sectors such as industry and vehicle.
Key word:
time sensitive network
per-stream filtering and policing
stream filters
stream gates
flow metering

Smart grid FDI detection based on autoencoder and random tree

DOI:10.16157/j.issn.0258-7998.245197

Author:Jing Feng

Author Affilications:State Grid Corporation of Shanxi Electric Power Company Information Communication Branch

Abstract:To cope with new types of cyber attacks (e.g. false data injection attacks) that may be applied to smart grid systems, a machine learning-based intrusion detection method is proposed. The method employs an autoencoder for data dimensionality reduction and uses an extreme random tree classifier to detect potential attacks. The performance of the method is tested under different system sizes and attack levels based on IEEE standard power system data. The experimental results show that in the IEEE 118-node system, the detection accuracy of the method is as high as 99.76%, and the F1 score reaches 99.77% even when only 0.1% of the attack measurements are available, which is much higher than other algorithms. This method is not only effective in detecting intrusions in smart grids, but also has high computational efficiency.
Key word:
attack detection
autoencoder
cyber attack
extreme random tree
spurious data injection
smart grid

Computer Technology and Its Applications

ßFA: a high-performance data processing algorithm based on vector instruction set

DOI:10.16157/j.issn.0258-7998.245114

Author:Yang Jiajia,Guan Jian,Li Zheng,Yu Zengming,Yao Wangjun

Author Affilications:The Sixth Research Institute of China Electronics Corporation

Abstract:Regular expression matching technology plays a significant role in data processing tasks such as data cleaning, parsing, and extraction. However, due to issues such as strong data dependency and unpredictable memory access in the matching process, the matching performance is relatively low. In response to this problem, this paper proposes a high-performance regular expression data processing algorithm based on vector instruction set, which is called ßFA. By using vector instructions to read a sequence of consecutive characters at once, and performing vector matching with the non-trusted character set corresponding to the most frequently accessed state, built-in functions can be utilized to find the position of the first non-trusted character, thus obtaining the number of characters that can be skipped directly, thereby accelerating the matching performance. Experimental results show that the throughput of the ßFA algorithm is superior to the original DFA algorithm and the αFA algorithm, being 4.67~60 times faster than the original DFA algorithm and 4.37~7.82 times faster than the αFA algorithm.
Key word:
regular expression matching
vector instruction set
high-performance data processing

Research on rice pest and disease classification based on federated learning under smart agriculture

DOI:10.16157/j.issn.0258-7998.245131

Author:Huang Jiongjiong

Author Affilications:School of Mathematics and Computer Science, Zhejiang A&F University

Abstract:In the ongoing process of agricultural development, the health of crops continues to be a pivotal research area. Addressing this issue, this paper endeavors to delve into the classification of rice diseases within the framework of intelligent agricultural planting. Within the context of intelligent agricultural planting, this paper advocates the adoption of federated learning as a means to enhance the accuracy of disease classification equipment while safeguarding the data privacy of individual devices, thereby addressing the data silo problem among these devices. For the experimental phase, seven pre-trained models are meticulously selected to extract pertinent features, and four evaluation metrics—accuracy, recall, loss function, and F1-score—are employed to assess the performance of these models. The experimental outcomes reveal that the VGG19 model achieved remarkable accuracy levels of 99.05% and 98.48% on Independent and Identically Distributed (IID) and Non-Independent and Identically Distributed (Non-IID) data sets, respectively, showcasing its robustness and precision. Through a series of experiments and comparative analyses of various indicators, it is conclusively established that the integration of federated learning has enhanced the accuracy of the equipment by a noteworthy margin of 4.36%. Furthermore, the convergence time of the image classification model is influenced by a combination of factors, including the number of federated learning rounds and the training epochs per round within the training set. Notably, the stability of the model improves as the number of devices participating in federated learning increases.
Key word:
smart agriculture
federated learning
image classification
pre-trained models

Security enhancement framework for edge computing mobile access networks based on blockchain and neuro-fuzzy systems

DOI:10.16157/j.issn.0258-7998.244934

Author:Zhang Zijian,Zhang Jing,Xu Daolei,Tang Yixuan

Author Affilications:Information and Communication Branch of State Grid Anhui Electric Power Co., Ltd.

Abstract:In order to improve the security of Internet of Vehicles(IoV), a network security enhancement framework for mobile edge computing based on neuro-fuzzy system and blockchain is proposed. The framework combines technologies such as bulk authentication and key exchange, combines blockchain with mobile edge computing, and is architecturally divided into three layers: perception layer, edge computing layer, and service layer.The perception layer ensures data security through blockchain, the edge computing layer provides resources, and the service layer protects the data using blockchain technology. Simulation shows that combining blockchain and mobile edge computing can effectively improve the security of IoV.
Key word:
blockchain
VANET
neuro-fuzzy systems
edge computing
6G

Circuits and Systems

Design and test of SiP circuit of electromechanical control based on advanced technology

DOI:10.16157/j.issn.0258-7998.245167

Author:Xu Wenyun,Zhang Ming,Zheng Lihua

Author Affilications:No.58 Research Institute, China Electronics Technology Group Corporation

Abstract:Electromechanical control system contains many kinds of chips, and its miniaturization and lightweight demand is increasingly urgent. As an advanced packaging method, system level packaging (System in Package)technology can integrate many different types of chips in a smaller space. In this paper, a SiP circuit of electromechanical control is designed based on system-level packaging technology, combined with TSV and FanOUT technology. The circuit includes the DSP signal control unit and the bottom FPGA signal processing unit, which are stacked in the form of PoP to form a SiP circuit. Compared with the electromechanical control system built by conventional discrete devices, the SiP volume is reduced by more than 70% and the weight is reduced by more than 80%. A test system is designed for SiP circuit, and a loopback test method is proposed for ADC and DAC chips, which can improve the test efficiency. The results show that the circuit meets the design requirements and has a certain application prospect in the field of electromechanical control.
Key word:
electromechanical control
system level packaging
package on package
high integrate

Design of a wide input range and high efficiency space secondary converter

DOI:10.16157/j.issn.0258-7998.245193

Author:Li Shuanggang,Xu Huan,Cao Jiajun

Author Affilications:Shanghai Institute of Space Power-sources

Abstract:This article proposes a design method for a wide input voltage and high efficiency 100 W space secondary power supply. The input voltage range reaches 20 V~50 V, which can cover the voltage range of 28 V and 42 V buses on the satellite. The peak efficiency reaches over 90%, with the characteristics of small size and high power density. Adopting a single ended flyback circuit topology and incorporating slope compensation technology, the duty cycle of the flyback converter operates at over 60%, expanding the input voltage range. Simultaneously using planar transformer technology to reduce transformer leakage inductance, improve efficiency, and reduce size.
Key word:
secondary power supply
flyback
slope compensation
planar transformer

Research on the positioning method of intermodulation interference sources in satellite communication based on frequency difference

DOI:10.16157/j.issn.0258-7998.245134

Author:Li Hejin,Wang Zesen,Yang Lingbo,Pan Lei

Author Affilications:63751 Troops of the Chinese People's Liberation Army

Abstract:In this paper, the location problem of intermodulation interference in satellite communication system is studied. The mechanism of intermodulation interference is analyzed, and the location method of intermodulation interference is given. Based on the quantitative analysis of intermodulation interference spectrum characteristics and the frequency allocation of each station by spectrum control station, a dynamic database of each station frequency and frequency difference in the system is established. An intermodulation interference source location method based on frequency difference is designed to accurately locate the interference stations. The simulation results verify the feasibility of the proposed scheme in the intermodulation interference positioning process, effectively reduce the time of disturbance positioning and improve the probability of accurate positioning of the disturbance station, which provides a solution for the rapid positioning of the satellite communication system in the face of such interference, and has great practical significance for the stable operation of the satellite communication network.
Key word:
satellite communication
intermodulation interference
interference source location
frequency difference

High Performance Computering

Information Flow and Energy Flow in Industrial Digital Transformation

Special Antenna and Radio Frequency Front End

Radar Target Tracking Technology

Key Technologies of 5G-A and 6G

Key Technologies of 5G and Its Evolution

Key Technologies of 5G and Its Evolution

Processing and Application of Marine Target Characteristic Data

Smart Power

Antenna Technology and Its Applications

5G-Advanced and 6G

Smart Agriculture

5G Vertical Industry Application

Microelectronics in Medical and Healthcare

Key Technologies for 6G

Application of Edge Computing in IIoT

Deep Learning and Image Recognization

6G Microwave Millimeter-wave Technology

Radar Processing Technology and Evaluation

Space-Ground Integrated Technology

Industrial Ethernet Network

5G Vertical Industry Application

FPGA and Artificial Intelligence

Innovation and Application of PKS System

5G Network Construction and Optimization

RF and Microwave

Edge Computing

Network and Business Requirements for 6G

5G and Intelligent Transportation

5G R16 Core Network Evolution Technology

Satellite Nevigation Technology

5G R16 Evolution Technology

5G Wireless Network Evolution Technology

5G Network Planning Technology

5G Indoor Coverage Technology

5G MEC and Its Applications

5G Co-construction and Sharing Technology

Expert Forum

5G and Emergency Communication

5G Slicing Technology and Its Applications

Industrial Internet

5G Terminal Key Realization Technology

5G and Artificial Intelligence

5G and Internet of Vehicles

Terahertz Technology and Its Application

Signal and Information Processing

Artificial Intelligence

5G Communication

Internet of Things and the Industrial Big Data

Electronic Techniques of UAV System

Power Electronic Technology

Medical Electronics

Aerospace Electronic Technology

Robot and Industrial Automation

ADAS Technique and Its Implementation

Heterogeneous Computing

2016 IEEE International Conference on Integrated Circuits and Microsystems

ARINC859 Bus Technology

FC Network Technology

Measurement and Control Technology of Bus Network

GJB288A Bus

Key Techniques of 5G and Algorthm Implement

IEEE-1394 Bus

Signal Conditioning Technology of Sensors

AFDX Network Technology

Discrete Signal Processing

Energy-Efficient Computing

Motor control

2012 Altera Electronic Design Article Contest