Showing 541 - 560 results of 588 for search '"CUDA"', query time: 0.08s Refine Results
  1. 541

    Automatic detection and counting of stacked eucalypt timber using the YOLOv8 model by Casas, Gianmarco Goycochea, Ismail, Zool Hilmi, Limeira, Mathaus Messias Coimbra, Lopes da Silva, Antonilmar Araújo, Leite, Helio Garcia

    Published 2023
    “…The model was trained using an AdamW optimizer and implemented using Ultralytics YOLOv8.0.137, Python-3.10.12, and torch-2.0.1 + cu118 with CUDA support on NVIDIA T1000 (4096MiB). For model evaluation, the precision, recall, and mean Average Precision at a 50% confidence threshold (mAP50) were calculated. …”
    Get full text
    Article
  2. 542

    Study of the weakly-compressible SPH method for improving pressure distribution of violent fluid-structure impact flows by Junling He, Qingzhi Hou, Yuejin Cai, Chen Shaokang, Gao Ruixue

    Published 2022-06-01
    “…In this paper, to enhance computational efficiency, the SPH method is implemented on a General Processing Unit (GPU) platform using Compute Unified Device Architecture (CUDA). Parallelized SPH programs including the standard SPH method, Riemann-based SPH and Delta-SPH are verified by a dam break model with large Reynolds number and violent deformation of free surfaces. …”
    Get full text
    Article
  3. 543

    Fast dose calculation of convolution/superposition in radiotherapy based on multi GPU heterogeneous model by LAI Jialu, SONG Ying, ZHOU Li, BAI Xue, HOU Qing

    Published 2021-12-01
    “…High-end GPU, i.e., Tesla C2015, was used for experimental test of CS algorithm executing under the compute unified device architecture (CUDA) platform. Speeds of different number GPUs combined with CPU were compared to find the suitable solution.ResultsThe experimental results show that the speedup of CS algorithm is not completely linear with the number of GPUs. …”
    Get full text
    Article
  4. 544

    Virtual Clinical Trials in 2D and 3D X-ray Breast Imaging and Dosimetry: Comparison of CPU-Based and GPU-Based Monte Carlo Codes by Giovanni Mettivier, Antonio Sarno, Youfang Lai, Bruno Golosio, Viviana Fanti, Maria Elena Italiano, Xun Jia, Paolo Russo

    Published 2022-02-01
    “…We compared three different platforms for in-silico X-ray 3D breast imaging: the Agata (University & INFN Napoli) that was based on the Geant4 toolkit and running on a CPU-based server architecture; the XRMC Monte Carlo (University of Cagliari) that was based on the use of variance reduction techniques, running on a CPU hardware; and the Monte Carlo code gCTD (University of Texas Southwestern Medical Center) running on a single GPU platform with CUDA environment. The tests simulated the irradiation of cylindrical objects as well as anthropomorphic breast phantoms and produced 2D and 3D images and 3D maps of absorbed dose. …”
    Get full text
    Article
  5. 545

    WBIN-Tree: A Single Scan Based Complete, Compact and Abstract Tree for Discovering Rare and Frequent Itemset Using Parallel Technique by Shwetha Rai, Preetham Kumar, K. Nakul Shetty, M. Geetha, B. Giridhar

    Published 2024-01-01
    “…In this paper, a novel Weighted Binary Count Tree (WBIN-Tree) is proposed and implemented in CUDA to exploit the power of GPU and discover rules with rare antecedent and frequent consequent using parallel approach. …”
    Get full text
    Article
  6. 546

    A Flexible and General-Purpose Platform for Heterogeneous Computing by Jose Juan Garcia-Hernandez, Miguel Morales-Sandoval, Erick Elizondo-Rodríguez

    Published 2023-05-01
    “…Accelerating an algorithm for a specific device under a specific framework, i.e., CUDA/GPU, provides a solution with the highest possible performance at the cost of a loss in generality and requires an experienced programmer. …”
    Get full text
    Article
  7. 547

    FOCUS: fast Monte Carlo approach to coherence of undulator sources by M. Siano, G. Geloni, B. Paroli, D. Butti, T. Lefèvre, S. Mazzoni, G. Trad, U. Iriso, A. A. Nosych, L. Torino, M. A. C. Potenza

    Published 2023-01-01
    “…The core structure of the code, which is written in the language C++ accelerated with CUDA, combines an analytical description of the emitted electric fields and massively parallel computations on GPUs. …”
    Get full text
    Article
  8. 548

    Deep neural network-based physical distancing monitoring system with tensorRT optimization by Edi Kurniawan, Hendra Adinanta, Suryadi Suryadi, Bernadus Herdi Sirenden, Rini Khamimatul Ula, Hari Pratomo, Purwowibowo Purwowibowo, Jalu Ahmad Prakosa

    Published 2022-07-01
    “…The optimization process is based on TensorRT executed on Graphical Processing Unit (GPU) and Computer Unified Device Architecture (CUDA) platform. This research evaluates the inferencing speed of the well-known object detection model You-Only-Look-Once (YOLO) run on two different Artificial Intelligence (AI) machines. …”
    Get full text
    Article
  9. 549

    Automatic Detection and Counting of Stacked Eucalypt Timber Using the YOLOv8 Model by Gianmarco Goycochea Casas, Zool Hilmi Ismail, Mathaus Messias Coimbra Limeira, Antonilmar Araújo Lopes da Silva, Helio Garcia Leite

    Published 2023-12-01
    “…The model was trained using an AdamW optimizer and implemented using Ultralytics YOLOv8.0.137, Python-3.10.12, and torch-2.0.1 + cu118 with CUDA support on NVIDIA T1000 (4096MiB). For model evaluation, the precision, recall, and mean Average Precision at a 50% confidence threshold (mAP50) were calculated. …”
    Get full text
    Article
  10. 550

    High Performance Graph Data Imputation on Multiple GPUs by Chao Zhou, Tao Zhang

    Published 2021-01-01
    “…In this paper, we propose a scheme to perform the convolutional imputation algorithm with higher time performance on GPUs (Graphics Processing Units) by exploiting multi-core GPUs of CUDA architecture. We propose optimization strategies to achieve coalesced memory access for graph Fourier transform (GFT) computation and improve the utilization of GPU SM resources for singular value decomposition (SVD) computation. …”
    Get full text
    Article
  11. 551

    How the Processing Mode Influences Azure Kinect Body Tracking Results by Linda Büker, Vincent Quinten, Michel Hackbarth, Sandra Hellmers, Rebecca Diekmann, Andreas Hein

    Published 2023-01-01
    “…Euclidean distances of joint positions varied clinically relevantly with up to 87 mm between runs for CUDA and TensorRT; CPU and DirectML had no differences on the same computer. …”
    Get full text
    Article
  12. 552

    Halide: a language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines by Barnes, Connelly, Adams, Andrew, Paris, Sylvain, Ragan-Kelley, Jonathan Millar, Durand, Fredo, Amarasinghe, Saman P.

    Published 2014
    “…From simple Halide programs written in a few hours, we demonstrate performance up to 5x faster than hand-tuned C, intrinsics, and CUDA implementations optimized by experts over weeks or months, for image processing applications beyond the reach of past automatic compilers.…”
    Get full text
    Get full text
    Get full text
    Article
  13. 553

    Multicore processors and graphics processing unit accelerators for parallel retrieval of aerosol optical depth from satellite data: implementation, performance, and energy efficien... by Liu, Jia, Feld, Dustin, Xue, Yong, Garcke, Jochen, Soddemann, Thomas

    Published 2015
    “…The compute unified device architecture C (CUDA-C) has been used for the GPU implementation for NVIDIA’s graphic cards and open multiprocessing (OpenMP) for thread- parallelism in the multicore implementation. …”
    Get full text
    Article
  14. 554

    High-performance computing and communication models for solving the complex interdisciplinary problems on DPCS by Alias, N., Sahnoun, R., Malyshkin, V.

    Published 2017
    “…The specific methodologies of PC software under consideration include PVM, MPI, LUNA, MDC, OpenMP, CUDA and LINDA integrated with COMSOL and C++/C. …”
    Get full text
    Article
  15. 555
  16. 556

    BrainPy, a flexible, integrative, efficient, and extensible framework for general-purpose brain dynamics programming by Chaoming Wang, Tianqiu Zhang, Xiaoyu Chen, Sichao He, Shangyang Li, Si Wu

    Published 2023-12-01
    “…Models defined in BrainPy can be JIT compiled into binary instructions for various devices, including Central Processing Unit, Graphics Processing Unit, and Tensor Processing Unit, which ensures high-running performance comparable to native C or CUDA. Additionally, BrainPy features an extensible architecture that allows for easy expansion of new infrastructure, utilities, and machine-learning approaches. …”
    Get full text
    Article
  17. 557

    Methanol extract of Black soldier fly (Hermetia illucens) prepupae against Aeromonas and Staphylococcus aureus bacteria in vitro and in silico by Dahliatul Qosimah, Sanarto Santosa, Maftuch Maftuch, Husnul Khotimah, Loeki Enggar Fitri, Aulanni'am Aulanni'am, Lucia Tri Suwanti

    Published 2023-02-01
    “…Molecular docking of the active ingredients (defensin, chitin, and chitosan as well as fatty acids) in BSF was downloaded from the NCBI database and docked with the Hex Cuda 8.0 program, Correlation type parameters Shape + Electro and Grid Dimension 0.6. …”
    Get full text
    Article
  18. 558

    Password Cracking with Brute Force Algorithm and Dictionary Attack Using Parallel Programming by Ibrahim Alkhwaja, Mohammed Albugami, Ali Alkhwaja, Mohammed Alghamdi, Hussam Abahussain, Faisal Alfawaz, Abdullah Almurayh, Nasro Min-Allah

    Published 2023-05-01
    “…The results show that the NVIDIA GeForce GTX 1050 Ti with CUDA is significantly faster than the Intel(R) HD Graphics 630 GPU for cracking passwords, with a speedup of 11.5× and 10.4× for passwords with and without special characters, respectively. …”
    Get full text
    Article
  19. 559

    PADME - a new code for modeling planet georesources formation on heterogeneous computing systems by Viktor Protasov, Igor Kulikov

    Published 2019-05-01
    “…The aim of the study is to develop a new method for modeling planet formation in 3D2V formulation based on two-phase approach, adapted for using in heterogeneous computing systems equipped with graphics accelerators supporting NVIDIA CUDA technology. The methods of the study. Fluids-in-cells method of Belotserkovskii-Davydov, modified with using the Godunov method, is used to model the gas component. …”
    Get full text
    Article
  20. 560

    MVP-Stereo: A Parallel Multi-View Patchmatch Stereo Method with Dilation Matching for Photogrammetric Application by Qingsong Yan, Junhua Kang, Teng Xiao, Haibing Liu, Fei Deng

    Published 2024-03-01
    “…Second, MVP-Stereo leverages multi-scale parallel patchmatch to reconstruct the depth map for each image in a highly efficient manner, which is implemented by CUDA with random initialization, multi-scale parallel spatial propagation, random refinement, and the coarse-to-fine strategy. …”
    Get full text
    Article