-
541
Automatic detection and counting of stacked eucalypt timber using the YOLOv8 model
Published 2023“…The model was trained using an AdamW optimizer and implemented using Ultralytics YOLOv8.0.137, Python-3.10.12, and torch-2.0.1 + cu118 with CUDA support on NVIDIA T1000 (4096MiB). For model evaluation, the precision, recall, and mean Average Precision at a 50% confidence threshold (mAP50) were calculated. …”
Get full text
Article -
542
Study of the weakly-compressible SPH method for improving pressure distribution of violent fluid-structure impact flows
Published 2022-06-01“…In this paper, to enhance computational efficiency, the SPH method is implemented on a General Processing Unit (GPU) platform using Compute Unified Device Architecture (CUDA). Parallelized SPH programs including the standard SPH method, Riemann-based SPH and Delta-SPH are verified by a dam break model with large Reynolds number and violent deformation of free surfaces. …”
Get full text
Article -
543
Fast dose calculation of convolution/superposition in radiotherapy based on multi GPU heterogeneous model
Published 2021-12-01“…High-end GPU, i.e., Tesla C2015, was used for experimental test of CS algorithm executing under the compute unified device architecture (CUDA) platform. Speeds of different number GPUs combined with CPU were compared to find the suitable solution.ResultsThe experimental results show that the speedup of CS algorithm is not completely linear with the number of GPUs. …”
Get full text
Article -
544
Virtual Clinical Trials in 2D and 3D X-ray Breast Imaging and Dosimetry: Comparison of CPU-Based and GPU-Based Monte Carlo Codes
Published 2022-02-01“…We compared three different platforms for in-silico X-ray 3D breast imaging: the Agata (University & INFN Napoli) that was based on the Geant4 toolkit and running on a CPU-based server architecture; the XRMC Monte Carlo (University of Cagliari) that was based on the use of variance reduction techniques, running on a CPU hardware; and the Monte Carlo code gCTD (University of Texas Southwestern Medical Center) running on a single GPU platform with CUDA environment. The tests simulated the irradiation of cylindrical objects as well as anthropomorphic breast phantoms and produced 2D and 3D images and 3D maps of absorbed dose. …”
Get full text
Article -
545
WBIN-Tree: A Single Scan Based Complete, Compact and Abstract Tree for Discovering Rare and Frequent Itemset Using Parallel Technique
Published 2024-01-01“…In this paper, a novel Weighted Binary Count Tree (WBIN-Tree) is proposed and implemented in CUDA to exploit the power of GPU and discover rules with rare antecedent and frequent consequent using parallel approach. …”
Get full text
Article -
546
A Flexible and General-Purpose Platform for Heterogeneous Computing
Published 2023-05-01“…Accelerating an algorithm for a specific device under a specific framework, i.e., CUDA/GPU, provides a solution with the highest possible performance at the cost of a loss in generality and requires an experienced programmer. …”
Get full text
Article -
547
FOCUS: fast Monte Carlo approach to coherence of undulator sources
Published 2023-01-01“…The core structure of the code, which is written in the language C++ accelerated with CUDA, combines an analytical description of the emitted electric fields and massively parallel computations on GPUs. …”
Get full text
Article -
548
Deep neural network-based physical distancing monitoring system with tensorRT optimization
Published 2022-07-01“…The optimization process is based on TensorRT executed on Graphical Processing Unit (GPU) and Computer Unified Device Architecture (CUDA) platform. This research evaluates the inferencing speed of the well-known object detection model You-Only-Look-Once (YOLO) run on two different Artificial Intelligence (AI) machines. …”
Get full text
Article -
549
Automatic Detection and Counting of Stacked Eucalypt Timber Using the YOLOv8 Model
Published 2023-12-01“…The model was trained using an AdamW optimizer and implemented using Ultralytics YOLOv8.0.137, Python-3.10.12, and torch-2.0.1 + cu118 with CUDA support on NVIDIA T1000 (4096MiB). For model evaluation, the precision, recall, and mean Average Precision at a 50% confidence threshold (mAP50) were calculated. …”
Get full text
Article -
550
High Performance Graph Data Imputation on Multiple GPUs
Published 2021-01-01“…In this paper, we propose a scheme to perform the convolutional imputation algorithm with higher time performance on GPUs (Graphics Processing Units) by exploiting multi-core GPUs of CUDA architecture. We propose optimization strategies to achieve coalesced memory access for graph Fourier transform (GFT) computation and improve the utilization of GPU SM resources for singular value decomposition (SVD) computation. …”
Get full text
Article -
551
How the Processing Mode Influences Azure Kinect Body Tracking Results
Published 2023-01-01“…Euclidean distances of joint positions varied clinically relevantly with up to 87 mm between runs for CUDA and TensorRT; CPU and DirectML had no differences on the same computer. …”
Get full text
Article -
552
Halide: a language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines
Published 2014“…From simple Halide programs written in a few hours, we demonstrate performance up to 5x faster than hand-tuned C, intrinsics, and CUDA implementations optimized by experts over weeks or months, for image processing applications beyond the reach of past automatic compilers.…”
Get full text
Get full text
Get full text
Article -
553
Multicore processors and graphics processing unit accelerators for parallel retrieval of aerosol optical depth from satellite data: implementation, performance, and energy efficien...
Published 2015“…The compute unified device architecture C (CUDA-C) has been used for the GPU implementation for NVIDIA’s graphic cards and open multiprocessing (OpenMP) for thread- parallelism in the multicore implementation. …”
Get full text
Article -
554
High-performance computing and communication models for solving the complex interdisciplinary problems on DPCS
Published 2017“…The specific methodologies of PC software under consideration include PVM, MPI, LUNA, MDC, OpenMP, CUDA and LINDA integrated with COMSOL and C++/C. …”
Get full text
Article -
555
Deep Learning and Parallel Computing Environment for Bioengineering Systems /
Published 2019Get full text
software, multimedia -
556
BrainPy, a flexible, integrative, efficient, and extensible framework for general-purpose brain dynamics programming
Published 2023-12-01“…Models defined in BrainPy can be JIT compiled into binary instructions for various devices, including Central Processing Unit, Graphics Processing Unit, and Tensor Processing Unit, which ensures high-running performance comparable to native C or CUDA. Additionally, BrainPy features an extensible architecture that allows for easy expansion of new infrastructure, utilities, and machine-learning approaches. …”
Get full text
Article -
557
Methanol extract of Black soldier fly (Hermetia illucens) prepupae against Aeromonas and Staphylococcus aureus bacteria in vitro and in silico
Published 2023-02-01“…Molecular docking of the active ingredients (defensin, chitin, and chitosan as well as fatty acids) in BSF was downloaded from the NCBI database and docked with the Hex Cuda 8.0 program, Correlation type parameters Shape + Electro and Grid Dimension 0.6. …”
Get full text
Article -
558
Password Cracking with Brute Force Algorithm and Dictionary Attack Using Parallel Programming
Published 2023-05-01“…The results show that the NVIDIA GeForce GTX 1050 Ti with CUDA is significantly faster than the Intel(R) HD Graphics 630 GPU for cracking passwords, with a speedup of 11.5× and 10.4× for passwords with and without special characters, respectively. …”
Get full text
Article -
559
PADME - a new code for modeling planet georesources formation on heterogeneous computing systems
Published 2019-05-01“…The aim of the study is to develop a new method for modeling planet formation in 3D2V formulation based on two-phase approach, adapted for using in heterogeneous computing systems equipped with graphics accelerators supporting NVIDIA CUDA technology. The methods of the study. Fluids-in-cells method of Belotserkovskii-Davydov, modified with using the Godunov method, is used to model the gas component. …”
Get full text
Article -
560
MVP-Stereo: A Parallel Multi-View Patchmatch Stereo Method with Dilation Matching for Photogrammetric Application
Published 2024-03-01“…Second, MVP-Stereo leverages multi-scale parallel patchmatch to reconstruct the depth map for each image in a highly efficient manner, which is implemented by CUDA with random initialization, multi-scale parallel spatial propagation, random refinement, and the coarse-to-fine strategy. …”
Get full text
Article