publications




Selected Publications


2024

In Press


Journal Papers

  • SHREG: Mitigating Register Redundancy in GPUs
  • Seunghyun Jin, Hyunwuk Lee, Jonghyun Lee, Junsung Kim, and Won Woo Ro
  • [SCI-Q1]  

    Journal of Systems Architecture Vol. 145, Mar. 2024  (IF: 4.5, Q1, JCR2022)

Conference Papers

  • Geneva: A Dynamic Confluence of Speculative Execution and In-Order Commitment Windows
  • Yanghee Lee, Jiwon Lee, Jaewon Kwon, Yongju Lee, and Won Woo Ro
  • [Top-Tier]  

    The 61th Design Automation Conference (DAC), 2024   (IF: 3, NRF BK21four, Acceptance Rate: 23%)
  • REPrune: Channel Pruning via Kernel Representative Selection
  • Mincheol Park, Dongjin Kim, Cheonjun Park, Yuna Park, Gyeong Eun Gong, Won Woo Ro, and Suhyun Kim
  • [Top-Tier]  

    The 38th AAAI Conference on Artificial Intelligence (AAAI), 2024   (IF: 4, NRF BK21four, Acceptance Rate: 23.7% [2342/12100])


2023

Journal Papers

  • A Convertible Neural Processor Supporting Adaptive Quantization for Real-Time Neural Networks [Link]
  • Hongju Kal, Hyoseong Choi, Ipoom Jeong, Joon-Sung Yang, and Won Woo Ro
  • [SCI-Q1]  

    Journal of Systems Architecture Vol. 145, Nov. 2023  (IF: 4.5, Q1, JCR2022)

Conference Papers

  • INTERPRET: Inter-Warp Register Reuse for GPU Tensor Cores [Link]
  • Jae Seok Kwak, Myung Kuk Yoon, Ipoom Jeong, Seunghyun Jin, and Won Woo Ro
  • [Top-Tier]  

    The 32th International Conference on Parallel Architectures and Compilation Techniques (PACT), 2023   (IF: 3, NRF BK21four)
  • McCore: A Holistic Management of High-Performance Heterogeneous Multicores [Link]
  • Jaewon Kwon, Yongju Lee, Hongju Kal, Minjae Kim, Youngsok Kim, and Won Woo Ro
  • [Top-Tier]  

    The 56th International Symposium on Microarchitecture (MICRO), 2023   (IF: 4, NRF BK21four, Acceptance Rate: 23.8% [101/424])
  • AESPA: Asynchronous Execution Scheme to Exploit Bank-Level Parallelism of Processing-in-Memory [Link]
  • Hongju Kal, Chanyoung Yoo, and Won Woo Ro
  • [Top-Tier]  

    The 56th International Symposium on Microarchitecture (MICRO), 2023   (IF: 4, NRF BK21four, Acceptance Rate: 23.8% [101/424])
  • MAD MAcce: Supporting Multiply-Add Operations for Democratizing Matrix-Multiplication Accelerator [Link]
  • Seunghwan Sung, Sujin Hur, Dongho Ha, Sungwoo Kim, Yunho Oh, and Won Woo Ro
  • [Top-Tier]  

    The 56th International Symposium on Microarchitecture (MICRO), 2023   (IF: 4, NRF BK21four, Acceptance Rate: 23.8% [101/424])
  • Exploiting Inherent Properties of Complex Numbers for Accelerating Complex Valued Neural Networks [Link]
  • Hyunwuk Lee, Hyungjun Jang, Sungbin Kim, Sungwoo Kim, Wonho Cho, and Won Woo Ro
  • [Top-Tier]  

    The 56th International Symposium on Microarchitecture (MICRO), 2023   (IF: 4, NRF BK21four, Acceptance Rate: 23.8% [101/424])
  • TensorCV: Accelerating Inference-Adjacent Computation Using Tensor Processors [Link]
  • Dongho Ha, Won Woo Ro, and Hung-Wei Tseng
  • The 2023 ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED), 2023   (IF: 1, NRF BK21four)
  • R2D2: Removing ReDunDancy Utilizing Linearity of Address Generation in GPUs [Link]
  • Dongho Ha, Yunho Oh, and Won Woo Ro
  • [Top-Tier]  

    The 50th International Symposium on Computer Architecture (ISCA), 2023   (IF: 4, NRF BK21four, Acceptance Rate: 21.2% [79/372])
  • Early-Adaptor: An Adaptive Framework for Proactive UVM Memory Management [Link]
  • Seokjin Go, Hyunwuk Lee, Junsung Kim, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
  • The 2023 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023   (IF: 1, NRF BK21four, Acceptance Rate: 37.8%)
  • Lightning Talk: Efficiency and Programmability of DNN Accelerators and GPUs [Link]
  • Won Woo Ro
  • [Top-Tier]  

    The 60th Design Automation Conference (DAC), 2023   (IF: 3, NRF BK21four, Acceptance Rate: 23%)
  • Quixote: Improving Fidelity of Quantum Program by Independent Execution of Controlled Gates [Link]
  • Enhyeok Jang, Seungwoo Choi, and Won Woo Ro
  • [Top-Tier]  

    The 60th Design Automation Conference (DAC), 2023   (IF: 3, NRF BK21four, Acceptance Rate: 23%)
  • Balanced Column-Wise Block Pruning for Maximizing GPU Parallelism [Link]
  • Cheonjun Park, Mincheol Park, Hyun Jae Oh, Minkyu Kim, Myung Kuk Yoon, Suhyun Kim, and Won Woo Ro
  • [Top-Tier]  

    The 37th AAAI Conference on Artificial Intelligence (AAAI), 2023   (IF: 4, NRF BK21four, Oral Acceptance Rate: 10.8% [952/8777], Oral Presentation)
  • SnakeByte: A TLB Design with Adaptive and Recursive Page Merging in GPUs [Link]
  • Jiwon Lee, Ju Min Lee, Yunho Oh, William J. Song, and Won Woo Ro
  • [Top-Tier]  

    The 29th IEEE International Symposium on High-Performance Computer (HPCA), 2023   (IF: 4, NRF BK21four, Acceptance Rate: 25.0% [91/364])


2022

Journal Papers

  • TEA-RC: Thread Context-Aware Register Cache for GPUs [Link]
  • Ipoom Jeong, Yunho Oh, Won Woo Ro, and Myung Kuk Yoon
  • [SCI-Q2]  

    IEEE Access   (IF: 3.476, Q2, JCR2021)
  • CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs [Link]
  • Yunho Oh, Ipoom Jeong, Won Woo Ro, and Myung Kuk Yoon
  • [SCI-Q2]  

    IEEE Embedded Systems Letters   (IF: 2.169, Q2, JCR2020)
  • FLIXR: Embedding Index into Flash Translation Layer in SSDs [Link]
  • Gunjae Koo, Yunho Oh, Hung-Wei Tseng, Won Woo Ro, and Murali Annavaram
  • [SCI-Q2]  

    IEEE Transactions on Computers, doi: 10.1109/TC.2022.3154602., Feb. 2022   (IF: 2.663, Q2, JCR2020)

Conference Papers

  • Reconstructing Out-of-Order Issue Queue [Link]
  • Ipoom Jeong, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
  • [Top-Tier]  

    The 55th IEEE/ACM International Symposium on Microarchitecture (MICRO), 2022   (IF: 4, NRF BK21four, Acceptance Rate: 23.8% [83/348])


2021

Journal Papers

  • Two-Stage In-Storage Processing and Scheduling for Pattern Matching Applications [Link]
  • Joohyeong Yoon, Yoonjin Lee, Won Seob Jeong, and Won Woo Ro
  • [SCI-Q1]  

    IEEE Access, Vol. 9, pp. 95702-95715, Jun. 2021   (IF: 3.367, Q1, JCR2020)
  • PIMCaffe: Functional Evaluation of a Machine Learning Framework for In-Memory Neural Processing Unit [Link]
  • Won Jeon, Jiwon Lee, Dongseok Kang, Hongju Kal, and Won Woo Ro
  • [SCI-Q1]  

    IEEE Access, Vol. 9, pp. 96629-96640, Jul. 2021   (IF: 3.367, Q1, JCR2020)

Conference Papers

  • SPACE: Locality-Aware Processing in Heterogeneous Memory for Personalized Recommendations [Link]
  • Hongju Kal, Seokmin Lee, Gun Ko, and Won Woo Ro
  • [Top-Tier]  

    The 48th ACM/IEEE International Symposium on Computer Architecture (ISCA), 2021   (IF: 4, NRF BK21four, Acceptance Rate: 18.7% [76/406])


2020

Journal Papers

  • Hi-End: Hierarchical, Endurance-Aware STT-MRAM-Based Register File for Energy-Efficient GPUs [Link]
  • Won Jeon, Jun Hyun Park, Yoonsoo Kim, Gunjae Koo, and Won Woo Ro
  • [SCI-Q1]  

    IEEE Access, Vol. 8, pp. 127768-127780, Jul. 2020   (IF: 3.745, Q1, JCR2019)
  • REACT: Scalable and High-Performance Regular Expression Pattern Matching Accelerator for In-Storage Processing [Link]
  • Won Seob Jeong, Changmin Lee, Keunsoo Kim, Myung Kuk Yoon, Won Jeon, Myoungsoo Jung, and Won Woo Ro
  • [SCI-Q1]  

    IEEE Transactions on Parallel and Distributed Systems, Vol. 31, Issue 5, pp.1137-1151, May 2020   (IF: 3.402, Q1, JCR2018)

Conference Papers

  • Duplo: Lifting Redundant Memory Accesses of Neural Networks for GPU Tensor Cores [Link]
  • Hyeonjin Kim, Sungwoo Ahn, Yunho Oh, Bogil Kim, Won Woo Ro, and William J. Song
  • [Top-Tier]  

    The 53rd IEEE/ACM International Symposium on Microarchitecture (MICRO), 2020   (IF: 4, NRF BK21four, Acceptance Rate: 19.4% [82/422])
  • Check-In: In-Storage Checkpointing for Key-Value Store System Leveraging Flash-Based SSDs [Link]
  • Joohyeong Yoon, Won Seob Jeong, and Won Woo Ro
  • [Top-Tier]  

    The 47th ACM/IEEE International Symposium on Computer Architecture (ISCA), 2020   (IF: 4, NRF BK21+, Acceptance Rate: 18.2% [77/421])
  • CASINO Core Microarchitecture: Generating Out-of-Order Schedules Using Cascaded In-Order Scheduling Windows [Link]
  • Ipoom Jeong, Seihoon Park, Changmin Lee, and Won Woo Ro
  • [Top-Tier]  

    The 26th IEEE International Symposium on High Performance Computer Architecture (HPCA), 2020   (IF: 4, NRF BK21+, Acceptance Rate: 16.9% [48/284])


2019

Journal Papers

  • OverCome: Coarse-Grained Instruction Commit with Handover Register Renaming [Link]
  • Ipoom Jeong, Changmin Lee, Keunsoo Kim, and Won Woo Ro
  • [SCI-Q1]  

    IEEE Transactions on Computers, Vol. 68, Issue 12, pp. 1802-1816, Dec. 2019 (IF: 3.131, Q1, JCR2018)
  • Contents-Aware Partitioning Algorithm for Parallel High Efficiency Video Coding [Link]
  • Kyungah Kim and Won Woo Ro
  • Multimedia Tools and Applications, Vol. 78, Issue 9, pp. 11427-11442, May 2019 (IF: 2.101, Q3, JCR2018)
  • Fast CU Depth Decision for HEVC using Neural Networks [Link]
  • Kyungah Kim and Won Woo Ro
  • [SCI-Q1]  

    IEEE Transactions on Circuits and Systems for Video Technology, Vol. 29, No. 5, pp. 1462-1473, May 2019 (IF: 4.046, Q1, JCR2018)
  • Adaptive Cooperation of Prefetching and Warp Scheduling on GPUs [Link]
  • Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Murali Annavaram, and Won Woo Ro
  • [SCI-Q1]  

    IEEE Transactions on Computers, Vol. 68, No. 4, pp. 609-616, Apr. 2019 (IF: 3.131, Q1, JCR2018)

Conference Papers

  • Efficient Dilated-Winograd Convolutional Neural Networks [Link]
  • Minsik Kim, Cheonjun Park, Sungjun Kim, Taeyoung Hong, and Won Woo Ro
  • The 2019 IEEE International Conference on Image Processing (ICIP), 2019 Taipei, Taiwan, Sep. 22 - 25(Acceptance Rate: 46.2% [956/2068])
  • Linebacker: Preserving Victim Cache Lines in Idle Register Files of GPUs [Link]
  • Yunho Oh, Gunjae Koo, Murali Annavaram, and Won Woo Ro
  • [Top-Tier]  

    The 46th ACM/IEEE International Symposium on Computer Architecture (ISCA), 2019 Phoenix, Arizona, USA, Jun. 22 - 26 (IF: 4, NRF BK21four, Acceptance Rate:17.0% [62/365])


2018

Journal Papers

  • WASP: Selective Data Prefetching with Monitoring Runtime Warp Progress on GPUs [Link]
  • Yunho Oh, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, and Won Woo Ro
  • [SCI-Q1]  

    IEEE Transactions on Computers, Vol. 67, No. 9, pp. 1366-1373, Sep. 2018 (IF: 3.052, Q1, JCR2017)
  • Exploiting Pseudo-Quadtree Structure for Accelerating HEVC Spatial Resolution Downscaling Transcoder [Link]
  • Minsik Kim, Minyong Sung, Minwoo Kim, and Won Woo Ro
  • [SCI-Q1]  

    IEEE Transactions on Multimedia, Vol. 20, No. 9, pp. 2262-2275, Sep. 2018 (IF: 3.977, Q1, JCR2017)
  • Architectural Protection of Application Privacy against Software and Physical Attacks in Untrusted Cloud Environment [Link]
  • Lei Xu, JongHyuk Lee, Seung Hun Kim, Qingji Zheng, Shouhuai Xu, Taeweon Suh, Won Woo Ro, and Weidong Shi
  • [SCI-Q1]  

    IEEE Transactions on Cloud Computing, Vol. 6, No. 2, pp. 478-491, Apr-Jun. 2018 (IF: 7.928, Q1, JCR2017)
  • Simultaneous and Speculative Thread Migration for Improving Energy Efficiency of Heterogeneous Core Architectures [Link]
  • Changmin Lee and Won Woo Ro
  • [SCI-Q1]  

    IEEE Transactions on Computers, Vol. 67, No. 4, pp. 498-512, Apr. 2018 (IF: 3.052, Q1, JCR2017)

Conference Papers

  • FineReg: Fine-Grained Register File Management for Augmenting GPU Throughput [Link]
  • Yunho Oh, Myung Kuk Yoon, William J. Song, and Won Woo Ro
  • The 51st IEEE/ACM International Symposium on Microarchitecture
  • [Top-Tier]  

    (MICRO 2018) Fukuoka, Japan, Oct. 20 - 24, 2018 (IF: 4, NRF BK21+,Acceptance Rate:21.1% [74/351])
  • WIR: Warp Instruction Reuse to Minimize Repeated Computations in GPUs [Link]
  • Keunsoo Kim and Won Woo Ro
  • The 24th IEEE International Symposium on High Performance Computer Architecture
  • [Top-Tier]  

    (HPCA 2018)Wien, Austria, Feb. 24 - 28, 2018 (IF: 4, NRF BK21+, Acceptance Rate:20.8% [54/260])


2017

Journal Papers

  • Dynamic Resizing on Active Warps Scheduler to Hide Operation Stalls on GPUs [Link]
  • Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
  • [SCI-Q1]  

    IEEE Transactions on Parallel and Distributed Systems, Vol. 28, No. 11, pp. 3142-3156, Nov. 2017 (IF: 4.181, Q1, JCR2016)
  • Dynamic Load Balancing of Dispatch Scheduling for Solid State Disks [Link]
  • Myunghyun Jo and Won Woo Ro
  • [SCI-Q1]  

    IEEE Transactions on Computers, Vol. 66, No. 6, pp. 1034-1047, Jun. 2017 (IF: 2.916, Q1, JCR2016)
  • Improving Energy Efficiency of GPUs through Data Compression and Compressed Execution [Link]
  • Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Murali Annavaram, and Won Woo Ro
  • [SCI-Q1]  

    IEEE Transactions on Computers, Vol. 66, No. 5, pp. 834-847, May 2017 (IF: 2.916, Q1, JCR2016)

Conference Papers

  • Access Pattern-Aware Cache Management for Improving Data Utilization in GPU [Link]
  • Gunjae Koo, Yunho Oh, Won Woo Ro, and Murali Annavaram
  • The 44th ACM/IEEE International Symposium on Computer Architecture
  • [Top-Tier]  

    (ISCA 2017) Torronto, Canada, Jun. 24 - 28, 2017 (IF: 4, NRF BK21+, Acceptance Rate:16.8% [54/322])


2016

Journal Papers

  • Server Side, Play Buffer Based Quality Control for Adaptive Media Streaming [Link]
  • Keunsoo Kim, Benjamin Y. Cho, and Won Woo Ro
  • Multimedia Tools and Applications, Vol. 75, No. 10, pp. 5397-5415, May 2016 (IF: 1.331, Q2, JCR2015)
  • Exploiting Thread-Level Parallelism on HEVC by Employing Reference Dependency Graph [Link]
  • Minwoo Kim, Deokho Kim, Kyungah Kim, and Won Woo Ro
  • [SCI-Q1]  

    IEEE Transactions on Circuits and Systems for Video Technology, Vol. 26, No. 4, pp. 736-749, Apr. 2016 (IF: 2.254, Q1, JCR2015)
  • Parallel GPU Architecture Simulation Framework Exploiting Architectural-Level Parallelism with Timing Error Prediction [Link]
  • Sangpil Lee and Won Woo Ro
  • [SCI-Q1]  

    IEEE Transactions on Computers, Vol. 65, No. 4, pp. 1253-1265, Apr. 2016 (IF: 1.723, Q1, JCR2015)

Conference Papers

  • Virtual Thread: Maximizing Thread-Level Parallelism beyond GPU Scheduling Limit [Link]
  • Myung Kuk Yoon, Keunsoo Kim, Sangpil Lee, Won Woo Ro, and Murali Annavaram
  • The 43rd ACM/IEEE International Symposium on Computer Architecture
  • [Top-Tier]  

    (ISCA 2016)Seoul, Korea, Jun. 18 - 22, 2016 (IF: 4, NRF BK21+)
  • APRES: Improving Cache Efficiency by Exploiting Load Characteristics on GPUs [Link]
  • Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Won Woo Ro, and Murali Annavaram
  • The 43rd ACM/IEEE International Symposium on Computer Architecture
  • [Top-Tier]  

    (ISCA 2016)Seoul, Korea, Jun. 18 - 22, 2016 (IF: 4, NRF BK21+)
  • Warped-Slicer: Efficient Intra-SM Slicing through Dynamic Resource Partitioning for GPU Multiprogramming [Link]
  • Qiumin Xu, Hyeran Jeon, Keunsoo Kim, Won Woo Ro, and Murali Annavaram
  • The 43rd ACM/IEEE International Symposium on Computer Architecture
  • [Top-Tier]  

    (ISCA 2016) Seoul, Korea, Jun. 18 - 22, 2016 (IF: 4, NRF BK21+)
  • Warped-Preexecution: A GPU Pre-execution Approach for Improving Latency Hiding [Link]
  • Keunsoo Kim, Sangpil Lee, Myung Kuk Yoon, Gunjae Koo, Won Woo Ro, and Murali Annavaram
  • The 22nd IEEE International Symposium on High Performance Computer Architecture
  • [Top-Tier]  

    (HPCA 2016)Barcelona, Spain, Mar. 12 - 16, 2016 (IF: 4, NRF BK21+)


2015

Journal Papers

  • A Performance-Energy Model to Evaluate Single Thread Execution Acceleration [Link]
  • Seung Hun Kim, Dohoon Kim, Changmin Lee, Won Seob Jeong, Won Woo Ro, and Jean-Luc Gaudiot
  • IEEE Computer Architecture Letters, Vol.14, No.2, pp. 99-102, Dec. 2015 (IF: 0.677, Q3, JCR2014)
  • Dynamic Load Balancing of Parallel SURF with Vertical Partitioning [Link]
  • Deokho Kim, Minwoo Kim, Kyungah Kim, Minyong Sung, and Won Woo Ro
  • [SCI-Q1]  

    IEEE Transactions on Parallel and Distributed Systems, Vol. 26, No. 12, pp. 3358-3370, Dec. 2015 (IF: 2.170, Q1, JCR2014)
  • Network Variation and Fault Tolerant Performance Acceleration in Mobile Devices with Simultaneous Remote Execution [Link]
  • Keunsoo Kim, Benjamin Y. Cho, Won Woo Ro, and Jean-Luc Gaudiot
  • [SCI-Q1]  

    IEEE Transactions on Computers, Vol. 64, No. 10, pp. 2862-2874, Oct. 2015 (IF: 1.659, Q1, JCR2014)
  • Highly Secure Mobile Devices Assisted with Trusted Cloud Computing Environments [Link]
  • Doohwan Oh, Ilkyu Kim, Keunsoo Kim, Sang-Min Lee, and Won Woo Ro
  • ETRI Journal, Vol. 37, No. 2, pp. 348-358, Apr. 2015 (IF: 0.771, Q3, JCR2014)

Conference Papers

  • True Motion Compensation With Feature Detection for Frame Rate Up-Conversion [Link]
  • Kyungah Kim, Minwoo Kim, Deokho Kim, and Won Woo Ro
  • The 2015 IEEE International Conference on Image Processing
  • (ICIP 2015) Quebec City, Canada, Sep. 27 - 30, 2015
  • An Accelerated Separable Median Filter with Sorting Networks [Link]
  • Minsik Kim, Deokho Kim, Minyong Sung, and Won Woo Ro
  • The 2015 IEEE International Conference on Image Processing
  • (ICIP 2015) Quebec City, Canada, Sep. 27 - 30, 2015
  • Enhancing Software Dependability and Security with Hardware Supported Instruction Address Space Randomization [Link]
  • Seung Hun Kim, Lei Xu, Ziyi Liu, Zhiqiang Lin, Won Woo Ro, and Weidong Shi
  • The 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks
  • (DSN 2015) Rio de Janerio, Brazil, Jun. 22 - 25, 2015
  • Warped-Compression: Enabling Power Efficient GPUs through Register Compression [Link]
  • Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Won Woo Ro, and Murali Annavaram
  • The 42nd ACM/IEEE International Symposium on Computer Architecture
  • [Top-Tier]  

    (ISCA 2015) Portland, OR, USA, Jun. 13 - 17, 2015 (IF: 4, NRF BK21+)
  • DRAW: Investigating Benefits of Adaptive Fetch Group Size on GPU [Link]
  • Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
  • The 2015 IEEE International Symposium on Performance Analysis of Systems and Software
  • (ISPASS 2015) Philadelphia, PA, USA, Mar. 29 - 31, 2015


2014

Journal Papers

  • A Malicious Pattern Detection Engine for Embedded Security Systems in Internet of Things [Link]
  • Doohwan Oh, Deokho Kim, and Won Woo Ro
  • Sensors, Vol. 14, No. 12, pp. 24188-24211, Dec. 2014 (IF: 2.048, Q2, JCR2013)
  • C-Lock: Energy Efficient Synchronization for Embedded Multicore Systems [Link]
  • Seung Hun Kim, Sang Hyong Lee, Minje Jun, Byunghoon Lee, Won Woo Ro, Eui-Young Chung,
    and Jean-Luc Gaudiot
  • IEEE Transactions on Computers, Vol. 63, No. 8, pp. 1962-1974, Aug. 2014 (IF: 1.473, Q2, JCR2013)
  • Swarm Processor System: Hardware Process Scheduler based Energy Efficient Multi-Core System [Link]
  • Won Seob Jeong, Seung Hun Kim, Sang-Min Lee, and Won Woo Ro
  • IEICE Electronics Express, Vol. 11, No. 14, pp. 20140424, July 2014 (IF: 0.391, Q4, JCR2013)
  • Complexity-Effective Contention Management with Dynamic Backoff for Transactional Memory Systems [Link]
  • Seung Hun Kim, Dongmin Choi, Won Woo Ro, and Jean-Luc Gaudiot
  • IEEE Transactions on Computers, Vol. 63, No. 7, pp. 1696-1708, July 2014 (IF: 1.473, Q2, JCR2013)
  • Architectural Investigation of Matrix Data Layout on Multicore Processors [Link]
  • Minwoo Kim and Won Woo Ro
  • [SCI-Q1]  

    Future Generation Computer Systems, Vol. 37, pp. 64-75, July 2014 (IF: 2.639, Q1, JCR2013)
  • Exploiting Implementation Diversity and Partial Connection of Routers in Application-Specific Network-on-Chip Topology Synthesis [Link]
  • Minje Jun, Won Woo Ro, and Eui-Young Chung
  • IEEE Transactions on Computers, Vol. 63, No. 6, pp. 1434-1445, Jun. 2014 (IF: 1.473, Q2, JCR2013)
  • Accelerating MapReduce Framework on Multi-GPU Systems [Link]
  • Hai Jiang, Yi Chen, Zhi Qiao, Kuan-Ching Li, Won Woo Ro, and Jean-Luc Gaudiot
  • Cluster Computing, Vol. 17, No. 2, pp. 293-301, Jun. 2014 (IF: 0.949, Q3, JCR2013)
  • Boosting CUDA Applications with CPU-GPU Hybrid Computing [Link]
  • Changmin Lee, Won Woo Ro, and Jean-Luc Gaudiot
  • International Journal of Parallel Programming, Vol. 42, No. 2, pp. 384-404, Apr. 2014 (IF: 0.500, Q4, JCR2013)
  • This is an extension of our INTERACT-16 paper which has been selected as one of the best papers and recommended to IJPP.

Conference Papers

  • LUT based Secure Cloud Computing - an Implementation using FPGAs [Link]
  • Lei Xu, Pham Dang Khoa, Seung Hun Kim, Won Woo Ro, and Weidong Shi
  • 2014 International Conference on ReConFigurable Computing and FPGAs
  • (ReConFig 2014) Cancun, Mexico, Dec. 7 - 10, 2014
  • Workload Synthesis: Generating Benchmark Workloads from Statistical Execution Profile [Link]
  • Keunsoo Kim, Changmin Lee, Jung Ho Jung, and Won Woo Ro
  • IEEE International Symposium on Workload Characterization
  • (IISWC 2014) Raleigh, North Carolina, USA, Oct. 26 - 28, 2014


2013

Journal Papers

  • Parallelized Sub-Resource Loading for Web Rendering Engine [Link]
  • Deokho Kim, Changmin Lee, Sangpil Lee, and Won Woo Ro
  • Journal of Systems Architecture, Vol. 59, No. 9, pp. 785-793, Oct. 2013 (IF: 0.724, Q3, JCR2012)
  • Design and Evaluation of Random Linear Network Coding Accelerators on FPGAs [Link]
  • Sunwoo Kim, Won Seob Jeong, Won Woo Ro, and Jean-Luc Gaudiot
  • ACM Transactions on Embedded Computing Systems, Vol.13, No. 1, pp. 1-24, Aug. 2013 (IF: 1.178, Q2, JCR2012)
  • GPU-Friendly Parallel Genome Matching with Tiled Access and Reduced State Transition Table [Link]
  • Yunho Oh, Doohwan Oh, and Won Woo Ro
  • International Journal of Parallel Programming, Vol. 41, No. 4, pp. 526-551, Aug. 2013 (IF: 0.404, Q4, JCR2012)
  • A Distributed Signature Detection Method for Detecting Intrusions in Sensor Systems [Link]
  • Ilkyu Kim, Doohwan Oh, Myung Kuk Yoon, Kyueun Yi, and Won Woo Ro
  • Sensors, Vol. 13, No. 4, pp. 3998-4016, Mar. 2013 (IF: 1.953, Q3, JCR2012)
  • Exploiting SIMD Parallelism on Dynamically Partitioned Parallel Network Coding for P2P Systems [Link]
  • Deokho Kim, Karam Park, and Won Woo Ro
  • Computers & Electrical Engineering, Vol. 39, No. 1, pp. 55-56, Jan. 2013 (IF: 0.928, Q3, JCR2012)
  • Benefits of Using Parallelized Non-Progressive Network Coding [Link]
  • Minwoo Kim, Karam Park, and Won Woo Ro
  • [SCI-Q1]  

    Journal of Network and Computer Applications, Vol. 36, No. 1, pp. 293-305, Jan. 2013 (IF: 1.467, Q1, JCR2012)
  • Importance of Coherence Protocols with Network Applications on Multi-Core Processors [Link]
  • Kyueun Yi, Won Woo Ro, and Jean-Luc Gaudiot
  • IEEE Transactions on Computers, Vol. 62, No. 1, pp. 6-15, Jan. 2013 (IF: 1.379, Q2, JCR2012)

Conference Papers

  • XSD: Accelerating MapReduce by Harnessing the GPU inside an SSD [Link]
  • Benjamin Y. Cho, Won Seob Jeong, Doohwan Oh, and Won Woo Ro
  • The 1st Workshop on Near-Data Processing. In conjunction with the MICRO-46
  • (WoNDP 2013) Davis, USA, Dec. 8, 2013
  • Mark-Sharing: A Parallel Garbage Collection Algorithm for Low Synchronization Overhead [Link]
  • Hyunkyu Park, Changmin Lee, Seung Hun Kim, Won Woo Ro and Jean-Luc Gaudiot
  • The 19th IEEE International Conference on Parallel and Distributed Systems
  • (ICPADS 2013) Seoul, Korea, Dec. 15 - 18, 2013
  • MGMR: Multi-GPU Based MapReduce [Link]
  • Yi Chen, Zhi Qiao, Hai Jiang, Kuan-Ching Li, Won Woo Ro
  • The 8th International Conference on Grid and Pervasive Computing
  • (GPC 2013) Seoul, Korea, May. 9 - 11, 2013
  • Parallel GPU Architecture Simulation Framework Exploiting Work Allocation Unit Parallelism [Link]
  • Sangpil Lee and Won Woo Ro
  • The 2013 IEEE International Symposium on Performance Analysis of Systems and Software
  • (ISPASS 2013) Austin, TX, USA, Apr. 21 - 23, 2013


2012

Journal Papers

  • Multi-Threading and Suffix Grouping on Massive Multiple Pattern Matching Algorithm [Link]
  • Doohwan Oh and Won Woo Ro
  • The Computer Journal, Vol. 55, No. 11, pp. 1331-1346, Nov. 2012 (IF: 0.785, Q3, JCR2011)
  • Offloading of Media Transcoding for High-Quality Multimedia Services [Link]
  • Seung Hun Kim, Keunsoo Kim, Changmin Lee, and Won Woo Ro
  • IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, pp. 691-699, May 2012 (IF: 0.941, Q3, JCR2011)
  • Design of a Power-Efficient Parallel Pipelined Bloom Filter [Link]
  • Deokho Kim, Doohwan Oh, and Won Woo Ro
  • Electronics Letters, Vol. 48, No. 7, pp. 367-369, Mar. 2012 (IF: 0.965, Q3, JCR2011)
  • Reconfigurable and Parallelized Network Coding Decoder for VANETs [Link]
  • Sunwoo Kim and Won Woo Ro
  • [SCI-Q1]  

    Mobile Information Systems, Vol. 8, No. 1, pp. 45-59, Feb. 2012 (IF: 2.432, Q1, JCR2011)
  • Accelerated Network Coding with Dynamic Stream Decomposition on Graphics Processing Unit [Link]
  • Sangpil Lee and Won Woo Ro
  • The Computer Journal, Vol. 55, No. 1, pp. 21-34, Jan. 2012 (IF: 0.785, Q3, JCR2011)

Conference Papers

  • Conflict Avoidance Scheduling using Grouping List for Transactional Memory [Link]
  • Dongmin Choi, Seung Hun Kim, and Won Woo Ro
  • The 17th International Workshop on High-Level Parallel Programming Models and Supportive Environments
  • (HIPS-17) Shanghai, China, May 21, 2012
  • Cooperative Heterogeneous Computing for Parallel Processing on CPU/GPU Hybrids [Link]
  • Changmin Lee, Won Woo Ro, and Jean-Luc Gaudiot
  • The 16th Workshop on Interaction between Compilers and Computer Architectures
  • (INTERACT-16) New Orleans, USA, Feb. 25 - 29, 2012


2011

Journal Papers

  • A Novel Sequential Tree Algorithm Based on Scoreboard for MPI Broadcast Communication [Link]
  • Won-young Chung, Jae-won Park, Seung-Woo Lee, Won Woo Ro, and Yong-surk Lee
  • IEICE Transactions on Information and Systems, Vol 94, No. 12, pp. 2523-2527, December. 2011 (IF: 0.268, Q4, JCR2010)
  • Network Coding on Heterogeneous Multi-Core Processors for Wireless Sensor Networks [Link]
  • Deokho Kim, Karam Park, and Won W. Ro
  • Sensors, Vol 11, No. 8, pp. 7908-7933, Aug. 2011 (IF: 1.774, Q3, JCR2010)
  • A Low-Cost Standard Mode MPI Hardware Unit for Embedded MPSoC [Link]
  • Won-Young Chung, Ha-Young Jeong, Won W. Ro, and Yong-Surk Lee
  • IEICE Transactions on Information and Systems, Vol. E94-D, No.7, pp. 1497-1501, July 2011 (IF: 0.268, Q4, JCR2010)

Conference Papers

  • Parallel Transpose of Matrix Multiplication Based on the Tiling Algorithm [Link]
  • Minwoo Kim, Yong J. Jang, and Won W. Ro
  • The 54th IEEE International Midwest Symposium on Circuits and Systems
  • (MWSCAS 2011) Seoul, Korea, Aug. 7 - 10, 2011
  • Performance Evaluation of Adaptive Progressive Network Coding [Link]
  • Deokho Kim, Karam Park, and Won W. Ro
  • The 54th IEEE International Midwest Symposium on Circuits and Systems
  • (MWSCAS 2011) Seoul, Korea, Aug. 7 - 10, 2011


2010

Journal Papers

  • Multithreaded Pattern Matching Algorithm with Data Rearrangement [Link]
  • Doohwan Oh, Seung Hun Kim, and Won W. Ro
  • IEICE Electronics Express, Vol. 7, No. 20, pp. 1520-1526, Oct. 2010 (IF: 0.510, Q3, JCR2009)
  • On Improving Parallelized Network Coding with Dynamic Partitioning [Link]
  • Karam Park, Joon-Sang Park, and Won W. Ro
  • [SCI-Q1]  

    IEEE Transactions on Parallel and Distributed Systems, Vol. 21, No. 11, pp. 1547-1560, Nov. 2010 (IF: 1.733, Q1, JCR2009)
  • Hardware Implementation of a Tessellation Accelerator for the OpenVG Standard [Link]
  • Seung Hun Kim, Yunho Oh, Karam Park, and Won W. Ro
  • IEICE Electronics Express, Vol. 7, No. 6, pp. 440-446, Mar. 2010 (IF: 0.510, Q3, JCR2009)

Conference Papers

  • Development of Virtual CUDA Systems of Parallel Processing on CPU and GPGPU [Link]
  • Doohwan Oh, Sangpil Lee, Deokho Kim, Changmin Lee, and Won W. Ro
  • Workshop on Micro Architectural Support for Virtualization, Data Center Computing, and Clouds In Conjunction with MICRO 2010
  • (MASVDC Workshop 2010) Atlanta, USA, Dec. 5, 2010
  • Implementing FFT using SPMD style of OpenMP [Link]
  • Tien-Hsiung Weng, Sheng-Wei Huang, Won Woo Ro, and Kuan-Ching Li
  • In Proc. of the 6th International Conference on Networked Computing and Advanced Information Management
  • (NCM 2010) Seoul, Korea, Aug. 16 - 18, 2010
  • Accelerated Reconstruction Using Parallel Computing for Spiral Spectroscopic Imaging [Link]
  • Dong H. Kim, Yun H. Oh, Yun H. Nam, M. Gu, and Won W. Ro
  • In Proc. of 2010 International Society for Magnetic Resonance in Medicine Annual Meeting
  • (2010 ISMRM Annual Meeting) Stockholm, Sweden, May 1 - 7, 2010
  • FPGA Implementation of Highly Parallelized Decoder Logic for Network Coding [Link]
  • Sunwoo Kim and Won W. Ro
  • In Proc. of Eighteenth ACM/SIGDA International Symposium on Field-Programmable Gate Arrays
  • (FPGA 2010) Monterey, USA, Feb. 21 - 23, 2010


2009

Journal Papers

  • A Complexity-Effective Microprocessor Design with Decoupled Dispatch Queues and Prefetching [Link]
  • Won W. Ro and Jean-Luc Gaudiot
  • Parallel Computing, Vol. 35, No. 5, pp. 255-268, May 2009 (IF: 1.309, Q2, JCR2008)

Conference Papers

  • Evaluation of Cache Coherence Protocols on Multi-Core Systems with Linear Workloads [Link]
  • Yong J. Jang and Won W. Ro
  • In Proc. of 2009 International Colloquium on Computing, Communication, Control, and Management
  • (CCCM 2009)Sanya, China, Aug. 8 - 9, 2009
  • Comparing Open Source Web Services: gSoap and AXIS [Link]
  • Jongwook Woo and Won W. Ro
  • In Proc. of the 24th International Technical Conference on Circuits/Systems, Computers and Communications
  • (ITC-CSCC 2009)Jeju Island, Korea, July 5 - 8, 2009
  • Efficient Parallelized Network Coding for P2P File Sharing Applications [Link]
  • Karam Park, Joon-Sang Park, and Won W. Ro
  • In Proc. of the 4th International Conference on Grid and Pervasive Computing
  • (GPC 2009)Geneva, Switcherland, May 4 - 8, 2009
  • Fully Pipelined Hardware Implementation of 128-bit SEED Block Cipher Algorithm [Link]
  • Jaeyoung Yi, Karam Park, Joonseok Park, and Won W. Ro
  • In Proc. of the 5th International Workshop on Applied Reconfigurable Computing
  • (ARC 2009)Karlsruhe, Germany, Mar. 16 - 18, 2009

Book Chapters

  • Programmability and Scalability on Multi-Core Architectures [Link]
  • Jaeyoung Yi, Yong J. Jang, Doohwan Oh, and Won W. Ro
  • Chapter in "Handbook of Research on Scalable Computing Technologies", edited by Kuan-Ching Li, Ching-Hsien Hsu, Laurence Tianruo Yang, Jack Dongarra, and Hans Zima, Information Science Reference, 2009


2008

Journal Papers

  • Efficient Peer-to-Peer File Sharing Using Network Coding in MANET [Link]
  • Uichin Lee, Joon-Sang Park, Seung-Hoon Lee, Won W. Ro, Giovanni Pau, and Mario Gerla
  • Journal of Communications and Networks, Vol. 10, No. 4, Dec. 2008 (IF: 0.223, Q4, JCR2007)
  • A Low-Complexity Microprocessor Design with Speculative Pre-Execution [Link]
  • Won W. Ro and Jean-Luc Gaudiot
  • Journal of Systems Architecture, Vol. 54, No. 12, pp. 1101-1112, Dec. 2008 (IF: 0.490, Q3, JCR2007)
  • Performance Evaluation of Programming Models for SMP-Based Clusters [Link]
  • Myungho Lee, Neungsoo Park, Won W. Ro, and Kuan-Ching Li
  • Journal of the Chinese Institute of Engineers, Vol. 31, No. 7, pp. 1181-1188, Dec. 2008 (IF: 0.183, Q4, JCR2007)
  • Simultaneous Thin-Thread Processors for Low-Power Embedded Systems [Link]
  • Won W. Ro, Jaeyoung Yi, Joon-Sang Park, and Joonseok Park
  • IEICE Electronics Express, Vol. 5, No. 19, pp. 802-808, Oct. 2008 (IF: 0.436, Q3, JCR2007)
  • Delay Analysis of Car-to-Car Reliable Data Delivery Strategies Based on Data Mulling with Network Coding [Link]
  • Joon-Sang Park, Uichin Lee, Soon Young Oh, Mario Gerla, Desmond Siumen Lun, Won W. Ro, and Joonseok Park
  • IEICE Transactions on Information and Systems, Vol. E91-D, No. 10, Oct. 2008 (IF: 0.245, Q4, JCR2007)

Conference Papers

  • Parallel Algorithms for Steiner Tree Problem [Link]
  • Joon-Sang Park, Won W. Ro, Handuck Lee, and Neungsoo Park
  • In Proc. of the 3rd International Conference on Convergence and Hybrid Information Technology
  • (ICHIT 2008)Busan, Korea, Nov. 11 - 13, 2008


2006

Journal Papers

  • Design and Evaluation of a Hierarchical Decoupled Architecture [Link]
  • Won W. Ro, Stephen P. Crago, Alvin M. Despain, and Jean-Luc Gaudiot
  • Journal of Supercomputing, Springer, Vol. 38, No. 3, pp. 237-259, Dec. 2006 (IF: 0.482, Q3, JCR2005)
  • Speculative Pre-Execution Assisted by Compiler (SPEAR) [Link]
  • Won W. Ro and Jean-Luc Gaudiot
  • Journal of Parallel and Distributed Computing, Elsevier, Vol. 66, No. 8, pp. 1076-1089, Aug. 2006 (IF: 0.900, Q2, JCR2005)

Conference Papers

  • Design and Effectiveness of Small-Sized Decoupled Dispatch Queues [Link]
  • Won W. Ro and Jean-Luc Gaudiot
  • In Proc. of European Conference on Parallel Computing - LNCS
  • (EURO-PAR 2006) Dresden, Germany, Aug. 29 - Sep. 1, 2006


2005

Conference Papers

  • A Low-Complexity Issue Queue Design with Speculative Pre-Execution [Link]
  • Won W. Ro and Jean-Luc Gaudiot
  • In Proc. of the 12th International Conference on High Performance Computing
  • (HiPC 2005) Goa, India, Dec. 18 - 21, 2005

Book Chapters

  • Techniques to Improve Performance Beyond Pipelining: Superpipelining, Superscalar, and VLIW [Link]
  • Jean-Luc Gaudiot, Jung-Yup Kang, and Won Woo Ro
  • Chapter in "Computer Architecture", a volume of "Advance in Computers", edited by Ali R.Hurson, Elsevier, 2005


2004

Conference Papers

  • SPEAR: A Hybrid Model for Speculative Pre-Execution [Link]
  • Won W. Ro and Jean-Luc Gaudiot
  • In Proc. of the 18th International Parallel and Distributed Processing Symposium
  • (IPDPS 2004)Santa Fe, New Mexico, 2004


2003

Conference Papers

  • HiDISC: A Decoupled Architecture for Data-Intensive Applications [Link]
  • Won W. Ro, Jean-Luc Gaudiot, Stephen P. Crago, and Alvin M. Despain
  • In Proc. of the 17th International Parallel and Distributed Processing Symposium
  • (IPDPS 2003)Nice, France, Apr. 22 - 26, 2003
  • Compiler Support for Dynamic Speculative Pre-Execution [Link]
  • Won W. Ro and Jean-Luc Gaudiot
  • In Proc. of the 7th Annual Workshop on Interaction between Compilers and Computer Architectures
  • (INTERACT-7) in conjunction with HPCA-9 Anaheim, California, Feb. 8, 2003


2000

Conference Papers

  • Memory Latency: to Tolerate or to Reduce? [Link]
  • Amol Bakshi, Jean-Luc Gaudiot, Wen-Yen Lin, Manil Makhija, Viktor K. Prasanna, Wonwoo Ro, and Chulho Shin
  • In Proc. of the 12th Symposium on Computer Architecture and High Performance Computing
  • (SBAC-PAD'00) Sao Pedro, Brazil, Oct. 24 - 27, 2000
  • A High-Performance, Hierarchical Decoupled Architecture [Link]
  • Stephen P. Crago, Alvin Despain, Jean-Luc Gaudiot, Manil Makhija, Wonwoo Ro, and Apoorv Srivastava
  • In Proc. of the Memory access Decoupling for superscalar and multiple issue Architectures
  • (MEDEA) Workshop in conjunction with PACT 2000 Philadelphia, Oct. 15, 2000
  • A Reliable Cluster Computing with a New Checkpointing RAID-x Architecture [Link]
  • Kai Hwang, Hai Jin, Roy Ho, and Wonwoo Ro
  • In Proc. of the 9th Heterogeneous Computing Workshop
  • (HCW) Cancun, Mexico, May 1, 2000

All Publications


2024

In Press


Journal Papers

  • SHREG: Mitigating Register Redundancy in GPUs
  • Seunghyun Jin, Hyunwuk Lee, Jonghyun Lee, Junsung Kim, and Won Woo Ro
  • Journal of Systems Architecture Vol. 145, Mar. 2024

Conference Papers

  • Geneva: A Dynamic Confluence of Speculative Execution and In-Order Commitment Windows
  • Yanghee Lee, Jiwon Lee, Jaewon Kwon, Yongju Lee, and Won Woo Ro
  • The 61th Design Automation Conference
  • (DAC 2024)
  • Systolic Array Architecture Supporting Multiple Scaling Factors for U-Net Quantization
  • Hyunwuk Lee and Won Woo Ro
  • The 23th International Conference on Electronics, Information, and Communication
  • (ICEIC 2024)
  • Evaluating Performance of Shared On-Chip Caches in Multi-GPUs
  • Gun Ko and Won Woo Ro
  • The 23th International Conference on Electronics, Information, and Communication
  • (ICEIC 2024)
  • A Multi-DNN Acceleration Architecture for Balanced QoS and Throughput
  • Ipoom Jeong, Sungji Choi, Minjae Kim, Enhyeok Jang, Seokjin Go, and Won Woo Ro
  • The 23th International Conference on Electronics, Information, and Communication
  • (ICEIC 2024)
  • Integrated Framework Design Methodologies to Support Processing-In-Memory Platforms
  • Enhyeok Jang, Hongju Kal, Jaewon Kwon, and Won Woo Ro
  • The 23th International Conference on Electronics, Information, and Communication
  • (ICEIC 2024)
  • REPrune: Channel Pruning via Kernel Representative Selection
  • Mincheol Park, Dongjin Kim, Cheonjun Park, Yuna Park, Gyeong Eun Gong, Won Woo Ro, and Suhyun Kim
  • The 38th AAAI Conference on Artificial Intelligence
  • (AAAI 2024)

2023


Journal Papers

  • A Convertible Neural Processor Supporting Adaptive Quantization for Real-Time Neural Networks
  • Hongju Kal, Hyoseong Choi, Ipoom Jeong, Joon-Sung Yang, and Won Woo Ro
  • Journal of Systems Architecture Vol. 145, Nov. 2023

Conference Papers

  • INTERPRET: Inter-Warp Register Reuse for GPU Tensor Cores
  • Jae Seok Kwak, Myung Kuk Yoon, Ipoom Jeong, Seunghyun Jin, and Won Woo Ro
  • The 32th International Conference on Parallel Architectures and Compilation Techniques
  • (PACT 2023)
  • McCore: A Holistic Management of High-Performance Heterogeneous Multicores
  • Jaewon Kwon, Yongju Lee, Hongju Kal, Minjae Kim, Youngsok Kim, and Won Woo Ro
  • The 56th International Symposium on Microarchitecture
  • (MICRO 2023)
  • AESPA: Asynchronous Execution Scheme to Exploit Bank-Level Parallelism of Processing-in-Memory
  • Hongju Kal, Chanyoung Yoo, and Won Woo Ro
  • The 56th International Symposium on Microarchitecture
  • (MICRO 2023)
  • MAD MAcce: Supporting Multiply-Add Operations for Democratizing Matrix-Multiplication Accelerator
  • Seunghwan Sung, Sujin Hur, Dongho Ha, Sungwoo Kim, Yunho Oh, and Won Woo Ro
  • The 56th International Symposium on Microarchitecture
  • (MICRO 2023)
  • Exploiting Inherent Properties of Complex Numbers for Accelerating Complex Valued Neural Networks
  • Hyunwuk Lee, Hyungjun Jang, Sungbin Kim, Sungwoo Kim, Wonho Cho, and Won Woo Ro
  • The 56th International Symposium on Microarchitecture
  • (MICRO 2023)
  • Performance Analysis of Criticality-Aware Out-of-Order Cores for Exploiting MLP
  • Yanghee Lee, Jiwon Lee, and Won Woo Ro
  • The 38th International Technical Conference on Circuits/Systems, Computers and Communications
  • (ITC-CSCC 2023)
  • Adaptive Data Prefetcher with Probability Learning in LLC
  • Jusin Kim, Jiwon Lee, and Won Woo Ro
  • The 38th International Technical Conference on Circuits/Systems, Computers and Communications
  • (ITC-CSCC 2023)
  • Context Swap: Multi-PIM System Preventing Remote Memory Access for Large Embedding Model Acceleration
  • Hongju Kal, Cheolhwan Kim, Minjae Kim, and Won Woo Ro
  • The 2023 IEEE International Conference on Artificial Intelligence Circuits and Systems
  • (AICAS 2023)
  • TensorCV: Accelerating Non-AI/ML Stages in Computing Vision Pipelines using Tensor Processors
  • Dongho Ha, Won Woo Ro, and Hung-Wei Tseng
  • The 2023 ACM/IEEE International Symposium on Low Power Electronics and Design
  • (ISLPED 2023)
  • R2D2: Removing ReDunDancy Utilizing Linearity of Address Generation in GPUs
  • Donho Ha, Yunho Oh, and Won Woo Ro
  • The 50th International Symposium on Computer Architecture
  • (ISCA 2023)
  • Early-Adaptor: An Adaptive Framework for Proactive UVM Memory Management
  • Seokjin Go, Hyunwuk Lee, Junsung Kim, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
  • The 2023 IEEE International Symposium on Performance Analysis of Systems and Software
  • (ISPASS 2023)
  • Lightning Talk: Efficiency and Programmability of DNN Accelerators and GPUs
  • Won Woo Ro
  • The 60th ACM/IEEE Design Automation Conference
  • (DAC 2023)
  • Quixote: Improving Fidelity of Quantum Program by Independent Execution of Controlled Gates
  • Enhyeok Jang, Seungwoo Choi, and Won Woo Ro
  • The 60th ACM/IEEE Design Automation Conference
  • (DAC 2023)
  • Balanced Column-Wise Block Pruning for Maximizing GPU Parallelism
  • Cheonjun Park, Mincheol Park, Hyun Jae Oh, Minkyu Kim, Myung Kuk Yoon, Suhyun Kim, and Won Woo Ro
  • The 37th AAAI Conference on Artificial Intelligence
  • (AAAI 2023)
  • SnakeByte: A TLB Design with Adaptive and Recursive Page Merging in GPUs
  • Jiwon Lee, Ju Min Lee, Yunho Oh, William J. Song, and Won Woo Ro
  • The 29th IEEE International Symposium on High-Performance Computer
  • (HPCA 2023)
  • Analysis on Memory Access Patterns of Server-Class Workloads in Page- and Cache Line- Granularity
  • Kyeonghoon Lim, Minjae Kim, Jiwon Lee, and Won Woo Ro
  • The 22th International Conference on Electronics, Information, and Communication
  • (ICEIC-2023)
  • Enabling Heterogeneous Memory System over CXL
  • Dongin Lee, Sungbin Kim, Hyungjun Jang, Sungwoo Kim, and Won Woo Ro
  • The 22th International Conference on Electronics, Information, and Communication
  • (ICEIC-2023)
  • Investigation on NVIDIA Ampere GPU Architecture with Reverse Engineering
  • Sujin Hur, Seunghwan Sung, Dongho Ha, Sungwoo Kim, and Won Woo Ro
  • The 22th International Conference on Electronics, Information, and Communication
  • (ICEIC-2023)


2022


Journal Papers

  • TEA-RC: Thread Context-Aware Register Cache for GPUs
  • Ipoom Jeong, Yunho Oh, Won Woo Ro, and Myung Kuk Yoon
  • Accepted to IEEE Access
  • CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs
  • Yunho Oh, Ipoom Jeong, Won Woo Ro, and Myung Kuk Yoon
  • Accepted to IEEE Embedded Systems Letters
  • (IEEE ESL)
  • FLIXR: Embedding Index into Flash Translation Layer in SSDs
  • Gunjae Koo, Yunho Oh, Hung-Wei Tseng, Won Woo Ro, and Murali Annavaram
  • Accepted to IEEE Transactions on Computers

Conference Papers

  • 다종의 프로세싱 인 메모리 구조를 활용하기 위한 BLAS 기반의 프레임 워크 구현
  • 유찬영, 장은혁, 갈홍주, 노원우
  • 대한전자공학회 추계학술대회
  • Reconstructing Out-of-Order Issue Queue
  • Ipoom Jeong, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
  • The 55th IEEE/ACM International Symposium on Microarchitecture
  • (MICRO 2022)
  • Analysis of SSD with Logical to Physical Address Mapping of Hot Data to Single Level Cell Area
  • Gyuseok Choe, Youngmin Lee, and Won Woo Ro
  • The 37th International Technical Conference on Circuits/Systems, Computers and Communications
  • (ITC-CSCC 2022)
  • Analysis of DRAM-based Network of DRAM Swap Space Adopting Latency Hiding Technique
  • Hyoseong Choi, Jiwon Lee, Jeonghoon Choi, and Won Woo Ro
  • The 37th International Technical Conference on Circuits/Systems, Computers and Communications
  • (ITC-CSCC 2022)
  • PR3D: Processing Recommendation Systems in 3D-Stacked DRAM Adopting Heterogeneous Data Format
  • Chanyoung Yoo, Hongju Kal, and Won Woo Ro
  • The 21th International Conference on Electronics, Information, and Communication
  • (ICEIC-2022)


2021


Journal Papers

  • Two-Stage In-Storage Processing and Scheduling for Pattern Matching Applications
  • Joohyeong Yoon, Yoonjin Lee, Won Seob Jeong, and Won Woo Ro
  • IEEE Access, Vol. 9, pp. 95702-95715, Jun. 2021
  • PIMCaffe: Functional Evaluation of a Machine Learning Framework for In-Memory Neural Processing Unit
  • Won Jeon, Jiwon Lee, Dongseok Kang, Hongju Kal, and Won Woo Ro
  • IEEE Access, Vol. 9, pp. 96629-96640, Jul. 2021

Conference Papers

  • SPACE: Locality-Aware Processing in Heterogeneous Memory for Personalized Recommendations
  • Hongju Kal, Seokmin Lee, Gun Ko, and Won Woo Ro
  • The 48th ACM/IEEE International Symposium on Computer Architecture
  • (ISCA-2021)
  • Analysis of GPU Scheduling Technique for Convergence Barrier
  • Jae Seok Kwak and Won Woo Ro
  • The 20th International Conference on Electronics, Information, and Communication
  • (ICEIC-2021)
  • Delay Analysis on Tensor Access Patterns of CNN Algorithms
  • Jonathan Robert Malin and Won Woo Ro
  • The 20th International Conference on Electronics, Information, and Communication
  • (ICEIC-2021)
  • Detecting Pattern of Warp Register Value Differences in CTA using GPU Compiler
  • Dongho Ha and Won Woo Ro
  • The 20th International Conference on Electronics, Information, and Communication
  • (ICEIC-2021)
  • Analysis of Multiple-Application Support Techniques in GPU
  • Jonghyun Lee and Won Woo Ro
  • The 6th International Conference On Consumer Electronics (ICCE) Asia
  • (ICCE-ASIA 2021)
  • Analysis of Key-Value SSD to Improve the Performance of Key-Value Store System
  • Gyuseok Choe, Jeonghoon Choi and Won Woo Ro
  • The 6th International Conference On Consumer Electronics (ICCE) Asia
  • (ICCE-ASIA 2021)
  • QoS-Aware Scheduling for Cellular Networks Using Deep Reinforcement Learning
  • Jonathan Robert Malin, Gun Ko and Won Woo Ro
  • The 18th IFIP International Conference on Network and Parallel Computing
  • (NPC 2021)


2020


Journal Papers

  • Hi-End: Hierarchical, Endurance-Aware STT-MRAM-Based Register File for Energy-Efficient GPUs
  • Won Jeon, Jun Hyun Park, Yoonsoo Kim, Gunjae Koo, and Won Woo Ro
  • IEEE Access, Vol. 8, pp. 127768-127780, Jul. 2020
  • REACT: Scalable and High-Performance Regular Expression Pattern Matching Accelerator for In-Storage Processing
  • Won Seob Jeong, Changmin Lee, Keunsoo Kim, Myung Kuk Yoon, Won Jeon, Myoungsoo Jung, and Won Woo Ro
  • IEEE Transactions on Parallel and Distributed Systems, Vol. 31, Issue 5, pp.1137-1151, May. 2020

Conference Papers

  • BENEFIT: Basic Linear Algebra Subprogram and Neural Network framework for FPGA-based Neural Processing Units
  • Dongseok Kang and Won Woo Ro
  • The Fifth International Conference On Consumer Electronics Asia
  • (ICCE-ASIA 2020)
  • Busan, Korea, Nov. 1 - 3, 2020
  • OASIS: Overhead Analysis of Systolic Neural Processing Unit on LSTM
  • Byunghwy Choi and Won Woo Ro
  • The Fifth International Conference On Consumer Electronics Asia
  • (ICCE-ASIA 2020)
  • Busan, Korea, Nov. 1 - 3, 2020
  • Interaction Data Analysis for Personalized Recommendation System
  • Seokmin Lee and Won Woo Ro
  • The Fifth International Conference On Consumer Electronics Asia
  • (ICCE-ASIA 2020)
  • Busan, Korea, Nov. 1 - 3, 2020
  • BODCA: Heterogeneous CPU-GPU computing system with Bandwidth-Optimized DRAM cache design
  • Sungji Choi and Won Woo Ro
  • The Fifth International Conference On Consumer Electronics Asia
  • (ICCE-ASIA 2020)
  • Busan, Korea, Nov. 1 - 3, 2020
  • Duplo: Lifting Redundant Memory Accesses of Neural Networks for GPU Tensor Cores
  • Hyeonjin Kim, Sungwoo Ahn, Yunho Oh, Bogil Kim, Won Woo Ro, and William J. Song
  • The 53rd IEEE/ACM International Symposium on Microarchitecture
  • (MICRO 2020)
  • Virutal Conference, Oct. 17 - Oct. 21, 2020
  • Check-In: In-Storage Checkpointing for Key-Value Store System Leveraging Flash-Based SSDs
  • Joohyeong Yoon, Won Seob Jeong, and Won Woo Ro
  • The 47th ACM/IEEE International Symposium on Computer Architecture
  • (ISCA 2020)
  • Virutal Conference, May. 29 - Jun. 3, 2020
  • CASINO Core Microarchitecture: Generating Out-of-Order Schedules Using Cascaded In-Order Scheduling Windows
  • Ipoom Jeong, Seihoon Park, Changmin Lee, and Won Woo Ro
  • The 26th International IEEE Symposium on High Performance Computer Architecture
  • (HPCA 2020)
  • San Diego, CA, USA, Feb. 22 - 26, 2020
  • Self-controllable refresh target row skip and inclusion technique for the intelligent DRAM
  • Jaein Song and Won Woo Ro
  • The 19th International Conference on Electronics, Information and Communication
  • (ICEIC 2020)
  • Access Characteristic-based Cache Replacement Policy in an SSD
  • Joohyeong Yoon and Won Woo Ro
  • The 19th International Conference on Electronics, Information and Communication
  • (ICEIC 2020)


2019


Journal Papers

  • OverCome: Coarse-Grained Instruction Commit with Handover Register Renaming
  • Ipoom Jeong, Changmin Lee, Keunsoo Kim, and Won Woo Ro
  • IEEE Transactions on Computers, Vol. 68, Issue 12, pp. 1802-1816, Dec. 2019
  • Contents-Aware Partitioning Algorithm for Parallel High Efficiency Video Coding
  • Kyungah Kim and Won Woo Ro
  • Multimedia Tools and Applications, Multimedia Tools and Applications, Vol. 78, Issue 9, pp. 11427-11442, May. 2019
  • Fast CU Depth Decision for HEVC using Neural Networks
  • Kyungah Kim and Won Woo Ro
  • IEEE Transactions on Circuits and Systems for Video Technology, Vol. 29, No. 5, pp. 1462-1473, May. 2019
  • Adaptive Cooperation of Prefetching and Warp Scheduling on GPUs
  • Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Murali Annavaram, and Won Woo Ro
  • IEEE Transactions on Computers, Vol. 68, No. 4, pp. 609-616, Apr. 2019

Conference Papers

  • Efficient Dilated-Winograd Convolutional Neural Networks
  • Minsik Kim, Cheonjun Park, Sungjun Kim, Taeyoung Hong, and Won Woo Ro
  • The 2019 IEEE International Conference on Image Processing, Accepted
  • Performance Scalability Limit of PARSEC Benchmark on a Many-Core Processor
  • Won Seob Jeong and Won Woo Ro
  • The 34th International Technical Conference on Circuits/Systems, Computers and Communications
  • (ITC-CSCC 2019)
  • Jeju, Korea, Jun. 23 - 26, 2019
  • Analysis of SSD Internal DRAM Sensitivity for a Key-Value Store
  • Yongseok Won, Yoonjin Lee, Won Seob Jeong, and Won Woo Ro
  • The 34th International Technical Conference on Circuits/Systems, Computers and Communications
  • (ITC-CSCC 2019)
  • Jeju, Korea, Jun. 23 - 26, 2019
  • Exploiting GPU hierarchical TLB in Multi-Application Execution
  • Hyun Jae Oh, Won Jeon, and Won Woo Ro
  • The 34th International Technical Conference on Circuits/Systems, Computers and Communications
  • (ITC-CSCC 2019)
  • Jeju, Korea, Jun. 23 - 26, 2019
  • Hierarchical, Compressed STT-MRAM Register File for GPU
  • Jun Hyun Park and Won Woo Ro
  • The 34th International Technical Conference on Circuits/Systems, Computers and Communications
  • (ITC-CSCC 2019)
  • Jeju, Korea, Jun. 23 - 26, 2019
  • Linebacker: Preserving Victim Cache Lines in Idle Register Files of GPUs
  • Yunho Oh, Gunjae Koo, Murali Annavaram, and Won Woo Ro
  • The 46th ACM/IEEE International Symposium on Computer Architecture
  • (ISCA 2019)
  • Phoenix, Arizona, USA, Jun. 22 - 26, 2019
  • Analysis of SSD Internal Cache Problem in a Key-Value Store System
  • Won Seob Jeong, Yongseok Won, and Won Woo Ro
  • The 2nd International Conference on Big Data and Smart Computing
  • (ICBDSC 2019)
  • Bali, Indonesia. Jan. 10 - 13, 2019


2018


Journal Papers

  • 고성능 그래픽 처리 장치 발전 동향
  • 하동호, 이현욱, 이지원, 오현재, 전원, 오윤호, 노원우
  • 한국정보과학회 정보과학회지
  • WASP: Selective Data Prefetching with Monitoring Runtime Warp Progress on GPUs
  • Yunho Oh, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, and Won Woo Ro
  • IEEE Transactions on Computers, Vol. 67, No. 9, pp. 1366-1373, Sep. 2018
  • Exploiting Pseudo-Quadtree Structure for Accelerating HEVC Spatial Resolution Downscaling Transcoder
  • Minsik Kim, Minyong Sung, Minwoo Kim, and Won Woo Ro
  • IEEE Transactions on Multimedia, Vol. 20, No. 9, pp. 2262-2275, Sep. 2018
  • Architectural Protection of Application Privacy against Software and Physical Attacks in Untrusted Cloud Environment
  • Lei Xu, JongHyuk Lee, Seung Hun Kim, Qingji Zheng, Shouhuai Xu, Taeweon Suh, Won Woo Ro, and Weidong Shi
  • IEEE Transactions on Cloud Computing, Vol. 6, No. 2, pp. 478-491, Apr-Jun. 2018
  • Simultaneous and Speculative Thread Migration for Improving Energy Efficiency of Heterogeneous Core Architectures
  • Changmin Lee and Won Woo Ro
  • IEEE Transactions on Computers, Vol. 67, No. 4, pp. 498-512, Apr. 2018

Conference Papers

  • Region of Interest based Frame Rate Up-Conversion using Encoded Bit-stream
  • Kyungah Kim and Won Woo Ro
  • International Conference on Communication, Image and Signal Processing
  • (CCISP 2018)
  • Sanya, China. Nov. 16 - 18, 2018
  • FineReg: Fine-Grained Register File Management for Augmenting GPU Throughput
  • Yunho Oh, Myung Kuk Yoon, William J. Song, and Won Woo Ro
  • The 51st IEEE/ACM International Symposium on Microarchitecture
  • (MICRO 2018)
  • Fukuoka, Japan, Oct. 20 - 24, 2018
  • Fast Intra LCU Decision using Deep Neural Networks
  • Kyungah Kim and Won Woo Ro
  • The International Conference On Big data, IoT, and Cloud Computing
  • (BIC-18)
  • Jeju, Korea, Aug. 20 - 22, 2018
  • Near-Data Processing Optimization for Efficient Neural Network Computations
  • Sungwoo Ahn, Won Jeon, and Won Woo Ro
  • The 3rd International Conference On Consumer Electronics Asia
  • (ICCE-ASIA 2018)
  • Jeju, Korea, Jun. 24 - 26, 2018
  • Constructing Resilient Region in Dynamic Optimization Systems via Dynamic Adjustment of Bias Thresholds
  • Ipoom Jeong and Won Woo Ro
  • The 3rd International Conference On Consumer Electronics Asia
  • (ICCE-ASIA 2018)
  • Jeju, Korea, Jun. 24 - 26, 2018
  • WIR: Warp Instruction Reuse to Minimize Repeated Computations in GPUs
  • Keunsoo Kim and Won Woo Ro
  • The 24th International IEEE Symposium on High Performance Computer Architecture
  • (HPCA 2018)
  • Wien, Austria, Feb. 24 - 28, 2018
  • Efficient and Reliable NAND Flash Channel for High-Speed Solid State Drives
  • Joohyeong Yoon, Won Seob Jeong, Won Jeon, and Won Woo Ro
  • The 17th International Conference on Electronics, Information and Communication
  • (ICEIC 2018)
  • Honolulu, HI, USA, Jan. 24 - 27, 2018
  • Fast Robot Software Framework with Object-Oriented Design
  • Heekuk Lee, Keunsoo Kim, and Won Woo Ro
  • The 17th International Conference on Electronics, Information and Communication
  • (ICEIC 2018)
  • Honolulu, HI, USA, Jan. 24 - 27, 2018


2017


Journal Papers

  • Dynamic Resizing on Active Warps Scheduler to Hide Operation Stalls on GPUs
  • Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
  • IEEE Transactions on Parallel and Distributed Systems, Vol. 28, No. 11, pp. 3142-3156, Nov. 2017
  • Dynamic Load Balancing of Dispatch Scheduling for Solid State Disks
  • Myunghyun Jo and Won Woo Ro
  • IEEE Transactions on Computers, Vol. 66, No. 6, pp. 1034-1047, Jun. 2017
  • Improving Energy Efficiency of GPUs through Data Compression and Compressed Execution
  • Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Murali Annavaram, and Won Woo Ro
  • IEEE Transactions on Computers, Vol. 66, No. 5, pp. 834-847, May 2017

Conference Papers

  • Parallel In-Order Execution Architecture for Low-Power Processor
  • Kyungmin Lee, Ipoom Jeong, and Won Woo Ro
  • The 14th International SoC Design Conference
  • (ISOCC 2017)
  • Seoul, Korea, Nov. 5 - 8, 2017
  • Characterizing Convolutional Neural Network Workloads on a Detailed GPU Simulator
  • Kwanghee Chang, Minsik Kim, Kyungah Kim, and Won Woo Ro
  • The 14th International SoC Design Conference
  • (ISOCC 2017)
  • Seoul, Korea, Nov. 5 - 8, 2017
  • Access Pattern-Aware Cache Management for Improving Data Utilization in GPU
  • Gunjae Koo, Yunho Oh, Won Woo Ro, and Murali Annavaram
  • The 44th ACM/IEEE International Symposium on Computer Architecture
  • (ISCA 2017)
  • Torronto, Canada, Jun. 24 - 28, 2017
  • Dynamic Warp Scheduler Selection Policy Using Linear Regression for GPUs
  • Hyunjune Shin, Kyungmin Lee, Ipoom Jeong, Jong Hyun Park, and Won Woo Ro
  • The 16th International Conference on Electronics, Information and Communication
  • (ICEIC 2017)
  • Phuket, Thailand, Jan. 11 - 14, 2017
  • Exploiting L2 Cache Sensitivity in Artificial Neural Network on GPUs
  • Seihoon Park, Yoonsoo Kim, Minsik Kim, and Won Woo Ro
  • The 16th International Conference on Electronics, Information and Communication
  • (ICEIC 2017)
  • Phuket, Thailand, Jan. 11 - 14, 2017
  • Optimizing Intersection and Reflection Step of Geometrical Optics using GPUs
  • Hyun Jin Chung, Myung Kuk Yoon, and Won Woo Ro
  • The 16th International Conference on Electronics, Information and Communication
  • (ICEIC 2017)
  • Phuket, Thailand, Jan. 11 - 14, 2017
  • Analysis of Error Tolerance in Convolution Neural Networks
  • Sangheon Kwon, Jong Hyun Park, and Won Woo Ro
  • The 16th International Conference on Electronics, Information and Communication
  • (ICEIC 2017)
  • Phuket, Thailand, Jan. 11 - 14, 2017


2016


Journal Papers

  • Server Side, Play Buffer Based Quality Control for Adaptive Media Streaming
  • Keunsoo Kim, Benjamin Y. Cho, and Won Woo Ro
  • Multimedia Tools and Applications, Vol. 75, No. 10, pp. 5397-5415, May 2016
  • Exploiting Thread-Level Parallelism on HEVC by Employing Reference Dependency Graph
  • Minwoo Kim, Deokho Kim, Kyungah Kim, and Won Woo Ro
  • IEEE Transactions on Circuits and Systems for Video Technology, Vol. 26, No. 4, pp. 736-749, Apr. 2016
  • Parallel GPU Architecture Simulation Framework Exploiting Architectural-Level Parallelism with Timing Error Prediction
  • Sangpil Lee and Won Woo Ro
  • IEEE Transactions on Computers, Vol. 65, No. 4, pp. 1253-1265, Apr. 2016

Conference Papers

  • Measuring Error-Tolerance in SRAM Architecture on Hardware Accelerated Neural Network
  • Sangheon Kwon, Kyungmin Lee, Yoonsoo Kim, Kyungah Kim, Changmin Lee, and Won Woo Ro
  • The 1st IEEE International Conference on Consumer Electronics Asia
  • (ICCE-ASIA 2016)
  • Seoul, Korea, Oct. 26 - 28, 2016
  • Virtual Thread: Maximizing Thread-Level Parallelism beyond GPU Scheduling Limit
  • Myung Kuk Yoon, Keunsoo Kim, Sangpil Lee, Won Woo Ro, and Murali Annavaram
  • The 43rd ACM/IEEE International Symposium on Computer Architecture
  • (ISCA 2016)
  • Seoul, Korea, Jun. 18 - 22, 2016
  • APRES: Improving Cache Efficiency by Exploiting Load Characteristics on GPUs
  • Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Won Woo Ro, and Murali Annavaram
  • The 43rd ACM/IEEE International Symposium on Computer Architecture
  • (ISCA 2016)
  • Seoul, Korea, Jun. 18 - 22, 2016
  • Warped-Slicer: Efficient Intra-SM Slicing through Dynamic Resource Partitioning for GPU Multiprogramming
  • Qiumin Xu, Hyeran Jeon, Keunsoo Kim, Won Woo Ro, and Murali Annavaram
  • The 43rd ACM/IEEE International Symposium on Computer Architecture
  • (ISCA 2016)
  • Seoul, Korea, Jun. 18 - 22, 2016
  • Warped-Preexecution: A GPU Pre-execution Approach for Improving Latency Hiding
  • Keunsoo Kim, Sangpil Lee, Myung Kuk Yoon, Gunjae Koo, Won Woo Ro, and Murali Annavaram
  • The 22nd International IEEE Symposium on High Performance Computer Architecture
  • (HPCA 2016)
  • Barcelona, Spain, Mar. 12 - 16, 2016
  • Accelerating Forwading Computation of ANN using CUDA
  • Jong Hyun Park and Won Woo Ro
  • The 15th International Conference on Electronics, Information and Communication
  • (ICEIC 2016)
  • Danang, Vietnam, Jan. 27 - 30, 2016
  • Fairness-Aware Thread Scheduling for Multithreaded Program using Intel Software Guarded Extensions
  • Won Jeon, Seung Hun Kim, and Won Woo Ro
  • The 15th International Conference on Electronics, Information and Communication
  • (ICEIC 2016)
  • Danang, Vietnam, Jan. 27 - 30, 2016


2015

Journal Papers

  • A Performance-Energy Model to Evaluate Single Thread Execution Acceleration
  • Seung Hun Kim, Dohoon Kim, Changmin Lee, Won Seob Jeong, Won Woo Ro, and Jean-Luc Gaudiot
  • IEEE Computer Architecture Letters, Vol.14, No.2, pp. 99-102, Dec. 2015
  • Dynamic Load Balancing of Parallel SURF with Vertical Partitioning
  • Deokho Kim, Minwoo Kim, Kyungah Kim, Minyong Sung, and Won Woo Ro
  • IEEE Transactions on Parallel and Distributed Systems, Vol. 26, No. 12, pp. 3358-3370, Dec. 2015
  • Network Variation and Fault Tolerant Performance Acceleration in Mobile Devices with Simultaneous Remote Execution
  • Keunsoo Kim, Benjamin Y. Cho, Won Woo Ro, and Jean-Luc Gaudiot
  • IEEE Transactions on Computers, Vol. 64, No. 10, pp. 2862-2874, Oct. 2015
  • Highly Secure Mobile Devices Assisted with Trusted Cloud Computing Environments
  • Doohwan Oh, Ilkyu Kim, Keunsoo Kim, Sang-Min Lee, and Won Woo Ro
  • ETRI Journal, Vol. 37, No. 2, pp. 348-358, Apr. 2015

Conference Papers

  • True Motion Compensation With Feature Detection for Frame Rate Up-Conversion
  • Kyungah Kim, Minwoo Kim, Deokho Kim, and Won Woo Ro
  • The 2015 IEEE International Conference on Image Processing
  • (ICIP 2015)
  • Quebec City, Canada, Sep. 27 - 30, 2015
  • An Accelerated Separable Median Filter with Sorting Networks
  • Minsik Kim, Deokho Kim, Minyong Sung, and Won Woo Ro
  • The 2015 IEEE International Conference on Image Processing
  • (ICIP 2015)
  • Quebec City, Canada, Sep. 27 - 30, 2015
  • Contention-Free Fair Queuing for High-Speed Storage with RAID-0 Architecture
  • Myung Hyun Jo and Won Woo Ro
  • The 17TH IEEE International Conference on High Performance Computing and Communications
  • (HPCC 2015)
  • New York, USA, Aug. 24 - 26, 2015
  • Integrity Protection for Big Data Processing with Dynamic Redundancy Computation
  • Zhimin Gao, Nicholas DeSalvo, Pham Dang Khoa, Seung Hun Kim, Lei Xu, Won Woo Ro, Rakesh M. Verma,
    and Weidong Shi
  • The 2015 IEEE International Conference on Autonomic Computing
  • (ICAC 2015)
  • Grenoble, France, July 7 - 10, 2015
  • Improving Pipeline Utilization with Two-Level Instruction Issue on GPUs
  • Yunho Oh, Jong Hyun Park, and Won Woo Ro
  • The 30th International Techinical Conference on Circuits/Systems, Computers and Communicaions
  • (ITC-CSCC 2015)
  • Seoul, Korea, Jun. 29 - July 2, 2015
  • Accelerating ELMs on the GPU Toward Real-Time Training on Large Scale Data Sets
  • Han Kyul Kim, Jong Hyun Park, and Won Woo Ro
  • The 30th International Technical Conference on Circuits/Systems, Computers and Communications
  • (ITC-CSCC 2015)
  • Seoul, Korea, Jun. 29 - July 2, 2015
  • A Frequency Scaling Model for Energy Efficient DVFS Designs based on Circuit Delay Optimization
  • Ki Bum Chun, Changmin Lee and Won Woo Ro
  • The 19th IEEE International Symposium on Consumer Electronics
  • (ISCE 2015)
  • UPM, Madrid, Spain, Jun. 24 - 26, 2015
  • Another Look at Secure Big Data Processing: a Formal Framework and a Practical Approach
  • Lei Xu, Seung Hun Kim, Won Woo Ro, and Weidong Shi
  • The 8th IEEE International Conference on Cloud Computing
  • (Cloud'15, Application Track)
  • New York, USA, Jun. 27 - July 2, 2015
  • Enhancing Software Dependability and Security with Hardware Supported Instruction Address Space Randomization
  • Seung Hun Kim, Lei Xu, Ziyi Liu, Zhiqiang Lin, Won Woo Ro, and Weidong Shi
  • The 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks
  • (DSN 2015)
  • Rio de Janerio, Brazil, Jun. 22 - 25, 2015
  • Warped-Compression: Enabling Power Efficient GPUs through Register Compression
  • Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Won Woo Ro, and Murali Annavaram
  • The 42nd ACM/IEEE International Symposium on Computer Architecture
  • (ISCA 2015)
  • Portland, OR, USA, Jun. 13 - 17, 2015
  • DRAW: Investigating Benefits of Adaptive Fetch Group Size on GPU
  • Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
  • The 2015 IEEE International Symposium on Performance Analysis of Systems and Software
  • (ISPASS 2015)
  • Philadelphia, PA, USA, Mar. 29 - 31, 2015



2014

Journal Papers

  • A Malicious Pattern Detection Engine for Embedded Security Systems in Internet of Things
  • Doohwan Oh, Deokho Kim, and Won Woo Ro
  • Sensors, Vol. 14, No. 12, pp. 24188-24211, Dec. 2014
  • C-Lock: Energy Efficient Synchronization for Embedded Multicore Systems
  • Seung Hun Kim, Sang Hyong Lee, Minje Jun, Byunghoon Lee, Won Woo Ro, Eui-Young Chung,
    and Jean-Luc Gaudiot
  • IEEE Transactions on Computers, Vol. 63, No. 8, pp. 1962-1974, Aug. 2014
  • Swarm Processor System: Hardware Process Scheduler based Energy Efficient Multi-Core System
  • Won Seob Jeong, Seung Hun Kim, Sang-Min Lee, and Won Woo Ro
  • IEICE Electronics Express, Vol. 11, No. 14, pp. 20140424, July 2014
  • Complexity-Effective Contention Management with Dynamic Backoff for Transactional Memory Systems
  • Seung Hun Kim, Dongmin Choi, Won Woo Ro, and Jean-Luc Gaudiot
  • IEEE Transactions on Computers, Vol. 63, No. 7, pp. 1696-1708, July 2014
  • Architectural Investigation of Matrix Data Layout on Multicore Processors
  • Minwoo Kim and Won Woo Ro
  • Future Generation Computer Systems, Vol. 37, pp. 64-75, July 2014
  • Exploiting Implementation Diversity and Partial Connection of Routers in Application-Specific Network-on-Chip Topology Synthesis
  • Minje Jun, Won Woo Ro, and Eui-Young Chung
  • IEEE Transactions on Computers, Vol. 63, No. 6, pp. 1434-1445, Jun. 2014
  • Accelerating MapReduce Framework on Multi-GPU Systems
  • Hai Jiang, Yi Chen, Zhi Qiao, Kuan-Ching Li, Won Woo Ro, and Jean-Luc Gaudiot
  • Cluster Computing, Vol. 17, No. 2, pp. 293-301, Jun. 2014
  • Boosting CUDA Applications with CPU-GPU Hybrid Computing
  • Changmin Lee, Won Woo Ro, and Jean-Luc Gaudiot
  • International Journal of Parallel Programming, Vol. 42, No. 2, pp. 384-404, Apr. 2014
  • This is an extension of our INTERACT-16 paper which has been selected as one of the best papers and recommended to IJPP.

Conference Papers

  • LUT based Secure Cloud Computing - an Implementation using FPGAs
  • Lei Xu, Pham Dang Khoa, Seung Hun Kim, Won Woo Ro, and Weidong Shi
  • 2014 International Conference on ReConFigurable Computing and FPGAs
  • (ReConFig 2014)
  • Cancun, Mexico, Dec. 7 - 10, 2014
  • Workload Synthesis: Generating Benchmark Workloads from Statistical Execution Profile
  • Keunsoo Kim, Changmin Lee, Jung Ho Jung, and Won Woo Ro
  • IEEE International Symposium on Workload Characterization
  • (IISWC 2014)
  • Raleigh, North Carolina, USA, Oct. 26 - 28, 2014
  • Accelerating Gesture Recognition Algorithm Using Coarse Grained Reconfigurable Architectures
  • Minsik Kim, Deokho Kim, Minyong Sung, Wonjae Lee, Jaehyun Kim, and Won Woo Ro
  • The 4th International Conference on Audio, Language and Image Processing
  • (ICALIP 2014)
  • Shanghai, China, July 7 - 9, 2014
  • A Micro-benchmark Suite to Understand Micro-Architectural Differences between Processors
  • Changmin Lee, Keunsoo Kim, Jung Ho Jung, and Won Woo Ro
  • The 29th International Technical Conference on Circuits/Systems, Computers and Communications
  • (ITC-CSCC 2014)
  • Phuket, Thailand, July 1 - 4, 2014
  • Maximizing DRAM Performance using Selective Operating Frequency Boosting
  • Jung Ho Jung, Seung Hun Kim, Changmin Lee, and Won Woo Ro
  • The 18th International Symposium on Consumer Electronics
  • (ISCE 2014)
  • Jeju, Korea, Jun. 22 - 25, 2014
  • Workload and Variation Aware Thread Scheduling for Heterogeneous Multi-processor
  • Seungwon Lee and Won Woo Ro
  • The 18th International Symposium on Consumer Electronics
  • (ISCE 2014)
  • Jeju, Korea, Jun. 22 - 25, 2014
  • Best paper award, Bronze prize
  • DPM: Data Partitioning Method for Pipelined MapReduce on GPU
  • Myung Hyun Jo and Won Woo Ro
  • The 18th International Symposium on Consumer Electronics
  • (ISCE 2014)
  • Jeju, Korea, Jun. 22 - 25, 2014
  • Accelerating HEVC Transcoder by Exploiting Decoded Quadtree
  • Minyong Sung, Minwoo Kim, Minsik Kim, and Won Woo Ro
  • The 18th International Symposium on Consumer Electronics
  • (ISCE 2014)
  • Jeju, Korea, Jun. 22 - 25, 2014
  • Multicore Speedup Models using Frequency Scaling with Fixed Power Budget
  • Seungwon Lee, Seung Hun Kim, and Won Woo Ro
  • The 13th International Conference on Electronics, Information and Communication
  • (ICEIC 2014)
  • Kota Kinabalu, Malaysia, Jan. 15 - 18, 2014
  • Hyper Threading-aware Virtual Machine Migration
  • Chungmu Oh, and Won Woo Ro
  • The 13th International Conference on Electronics, Information and Communication
  • (ICEIC 2014)
  • Kota Kinabalu, Malaysia, Jan. 15 - 18, 2014
  • Development of Efficient VCPU Pinning Mechanism in Xen
  • Kyung Yoon Min, Seung Hun Kim, and Won Woo Ro
  • The 13th International Conference on Electronics, Information and Communication
  • (ICEIC 2014)
  • Kota Kinabalu, Malaysia, Jan. 15 - 18, 2014
< br>


2013

Journal Papers

  • Parallelized Sub-Resource Loading for Web Rendering Engine
  • Deokho Kim, Changmin Lee, Sangpil Lee, and Won Woo Ro
  • Journal of Systems Architecture, Vol. 59, No. 9, pp. 785-793, Oct. 2013
  • Design and Evaluation of Random Linear Network Coding Accelerators on FPGAs
  • Sunwoo Kim, Won Seob Jeong, Won Woo Ro, and Jean-Luc Gaudiot
  • ACM Transactions on Embedded Computing Systems, Vol.13, No. 1, pp. 1-24, Aug. 2013
  • GPU-Friendly Parallel Genome Matching with Tiled Access and Reduced State Transition Table
  • Yunho Oh, Doohwan Oh, and Won Woo Ro
  • International Journal of Parallel Programming, Vol. 41, No. 4, pp. 526-551, Aug. 2013
  • A Distributed Signature Detection Method for Detecting Intrusions in Sensor Systems
  • Ilkyu Kim, Doohwan Oh, Myung Kuk Yoon, Kyueun Yi, and Won Woo Ro
  • Sensors, Vol. 13, No. 4, pp. 3998-4016, Mar. 2013
  • Exploiting SIMD Parallelism on Dynamically Partitioned Parallel Network Coding for P2P Systems
  • Deokho Kim, Karam Park, and Won Woo Ro
  • Computers & Electrical Engineering, Vol. 39, No. 1, pp. 55-56, Jan. 2013
  • Benefits of Using Parallelized Non-Progressive Network Coding
  • Minwoo Kim, Karam Park, and Won Woo Ro
  • Journal of Network and Computer Applications, Vol. 36, No. 1, pp. 293-305, Jan. 2013
  • Importance of Coherence Protocols with Network Applications on Multi-Core Processors
  • Kyueun Yi, Won Woo Ro, and Jean-Luc Gaudiot
  • IEEE Transactions on Computers, Vol. 62, No. 1, pp. 6-15, Jan. 2013

Conference Papers

  • Effcient Descriptor-Filtering Algorithm for Speeded Up Robust Features Matching
  • Minwoo Kim, Deokho Kim, Kyungah Kim, and Won Woo Ro
  • The 5th FTRA International Conference on Computer Science and its Applications
  • (CSA-13)
  • Danang, Vietnam, Dec. 18 - 21, 2013
  • XSD: Accelerating MapReduce by Harnessing the GPU inside an SSD
  • Benjamin Y. Cho, Won Seob Jeong, Doohwan Oh, and Won Woo Ro
  • The 1st Workshop on Near-Data Processing. In conjunction with the MICRO-46
  • (WoNDP 2013)
  • Davis, USA, Dec. 8, 2013
  • Mark-Sharing: A Parallel Garbage Collection Algorithm for Low Synchronization Overhead
  • Hyunkyu Park, Changmin Lee, Seung Hun Kim, Won Woo Ro and Jean-Luc Gaudiot
  • The 19th IEEE International Conference on Parallel and Distributed Systems
  • (ICPADS 2013)
  • Seoul, Korea, Dec. 15 - 18, 2013
  • Leveraging Effectiveness of Contention Management for Transactional Memory Systems with Performance Monitoring
  • Keunsoo Kim, Seung Hun Kim, Sang-min Lee, and Won Woo Ro
  • The 28th International Technical Conference on Circuits/Systems, Computer and Communications
  • (ITC-CSCC 2013)
  • Yeosu, Korea, Jun. 30 - July 3, 2013
  • MGMR: Multi-GPU Based MapReduce
  • Yi Chen, Zhi Qiao, Hai Jiang, Kuan-Ching Li, Won Woo Ro
  • The 8th International Conference on Grid and Pervasive Computing
  • (GPC 2013)
  • Seoul, Korea, May. 9 - 11, 2013
  • Parallel GPU Architecture Simulation Framework Exploiting Work Allocation Unit Parallelism
  • Sangpil Lee and Won Woo Ro
  • The 2013 IEEE International Symposium on Performance Analysis of Systems and Software
  • (ISPASS 2013)
  • Austin, TX, USA, Apr. 21 - 23, 2013
  • Directory Centralized Ring-based Interconnection for Multi-Core Systems
  • Myung Kuk Yoon, Sangpil Lee, Deokho Kim, and Won Woo Ro
  • The 12th International Conference on Electronics, Information and Communication
  • (ICEIC 2013)
  • Bali, Indonesia, Jan. 30 - Feb. 2, 2013
  • Parallel Garbage Collection with Transactional Memory
  • Hyunkyu Park, Changmin Lee, and Won Woo Ro
  • The 12th International Conference on Electronics, Information and Communication
  • (ICEIC 2013)
  • Bali, Indonesia, Jan. 30 - Feb. 2, 2013



2012

Journal Papers

  • Multi-Threading and Suffix Grouping on Massive Multiple Pattern Matching Algorithm
  • Doohwan Oh and Won Woo Ro
  • The Computer Journal, Vol. 55, No. 11, pp. 1331-1346, Nov. 2012
  • Offloading of Media Transcoding for High-Quality Multimedia Services
  • Seung Hun Kim, Keunsoo Kim, Changmin Lee, and Won Woo Ro
  • IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, pp. 691-699, May 2012
  • Design of a Power-Efficient Parallel Pipelined Bloom Filter
  • Deokho Kim, Doohwan Oh, and Won Woo Ro
  • Electronics Letters, Vol. 48, No. 7, pp. 367-369, Mar. 2012
  • Reconfigurable and Parallelized Network Coding Decoder for VANETs
  • Sunwoo Kim and Won Woo Ro
  • Mobile Information Systems, Vol. 8, No. 1, pp. 45-59, Feb. 2012
  • Accelerated Network Coding with Dynamic Stream Decomposition on Graphics Processing Unit
  • Sangpil Lee and Won Woo Ro
  • The Computer Journal, Vol. 55, No. 1, pp. 21-34, Jan. 2012

Conference Papers

  • On Migration and Consolidation of VMs in Hybrid CPU-GPU Environments
  • Kuan-Ching Li, Keunsoo Kim, Won Woo Ro, Tien-Hsiung Weng, Che-Lun Hung, Chen-Hao Ku, Albert Cohen, and Jean-Luc Gaudiot
  • International Conference on Intelligent Technologies and Engineering Systems
  • (ICITES 2012) - LNEE
  • Changhua, Taiwan, Dec. 13-15, 2012
  • Conflict Avoidance Scheduling using Grouping List for Transactional Memory
  • Dongmin Choi, Seung Hun Kim, and Won Woo Ro
  • The 17th International Workshop on High-Level Parallel Programming Models and Supportive Environments
  • (HIPS-17)
  • Shanghai, China, May 21, 2012
  • Cooperative Heterogeneous Computing for Parallel Processing on CPU/GPU Hybrids
  • Changmin Lee, Won Woo Ro, and Jean-Luc Gaudiot
  • The 16th Workshop on Interaction between Compilers and Computer Architectures
  • (INTERACT-16)
  • New Orleans, USA, Feb. 25 - 29, 2012
  • Matrix Data Layout Optimization for Multi-Core Architectures
  • Minwoo Kim, and Won Woo Ro
  • The 11th International Conference on Electronics, Information and Communication
  • (ICEIC 2012)
  • Jeongseon, Korea, Feb. 1 - 3, 2012
  • The Effect of Concurrency Control in Transactional Memory Systems
  • Seung Hun Kim, Dongmin Choi, and Won Woo Ro
  • The 11th International Conference on Electronics, Information and Communication
  • (ICEIC 2012)
  • Jeongseon, Korea, Feb. 1 - 3, 2012
  • Adaptive Replacement Cache in Transactional Memory
  • Dongmin Choi, Hyunkyu Park, Seung Hun Kim, and Won Woo Ro
  • The 11th International Conference on Electronics, Information and Communication
  • (ICEIC 2012)
  • Jeongseon, Korea, Feb. 1 - 3, 2012



2011

Journal Papers

  • A Novel Sequential Tree Algorithm Based on Scoreboard for MPI Broadcast Communication
  • Won-young Chung, Jae-won Park, Seung-Woo Lee, Won Woo Ro, and Yong-surk Lee
  • IEICE Transactions on Information and Systems, Vol 94, No. 12, pp. 2523-2527, December. 2011
  • Network Coding on Heterogeneous Multi-Core Processors for Wireless Sensor Networks
  • Deokho Kim, Karam Park, and Won W. Ro
  • Sensors, Vol 11, No. 8, pp. 7908-7933, Aug. 2011
  • A Low-Cost Standard Mode MPI Hardware Unit for Embedded MPSoC
  • Won-Young Chung, Ha-Young Jeong, Won W. Ro, and Yong-Surk Lee
  • IEICE Transactions on Information and Systems, Vol. E94-D, No.7, pp. 1497-1501, July 2011

Conference Papers

  • Parallel Transpose of Matrix Multiplication Based on the Tiling Algorithm
  • Minwoo Kim, Yong J. Jang, and Won W. Ro
  • The 54th IEEE International Midwest Symposium on Circuits and Systems
  • (MWSCAS 2011)
  • Seoul, Korea, Aug. 7 - 10, 2011
  • Performance Evaluation of Adaptive Progressive Network Coding
  • Deokho Kim, Karam Park, and Won W. Ro
  • The 54th IEEE International Midwest Symposium on Circuits and Systems
  • (MWSCAS 2011)
  • Seoul, Korea, Aug. 7 - 10, 2011



2010

Journal Papers

  • Multithreaded Pattern Matching Algorithm with Data Rearrangement
  • Doohwan Oh, Seung Hun Kim, and Won W. Ro
  • IEICE Electronics Express, Vol. 7, No. 20, pp. 1520-1526, Oct. 2010
  • On Improving Parallelized Network Coding with Dynamic Partitioning
  • Karam Park, Joon-Sang Park, and Won W. Ro
  • IEEE Transactions on Parallel and Distributed Systems, Vol. 21, No. 11, pp. 1547-1560, Nov. 2010
  • Hardware Implementation of a Tessellation Accelerator for the OpenVG Standard
  • Seung Hun Kim, Yunho Oh, Karam Park, and Won W. Ro
  • IEICE Electronics Express, Vol. 7, No. 6, pp. 440-446, Mar. 2010

Conference Papers

  • Development of Virtual CUDA Systems of Parallel Processing on CPU and GPGPU
  • Doohwan Oh, Sangpil Lee, Deokho Kim, Changmin Lee, and Won W. Ro
  • Workshop on Micro Architectural Support for Virtualization, Data Center Computing, and Clouds In Conjunction with MICRO 2010
  • (MASVDC Workshop 2010)
  • Atlanta, USA, Dec. 5, 2010
  • Implementing FFT using SPMD style of OpenMP
  • Tien-Hsiung Weng, Sheng-Wei Huang, Won Woo Ro, and Kuan-Ching Li
  • In Proc. of the 6th International Conference on Networked Computing and Advanced Information Management
  • (NCM 2010)
  • Seoul, Korea, Aug. 16 - 18, 2010
  • Multi-Threaded Filtered BackProjection Algorithm on Multi-Core Processors
  • Yun H. Oh and Won W. Ro
  • The 10th International Conference on Electronics, Information, and Communication
  • (ICEIC 2010)
  • Cebu, Philippines, Jun. 30 - July 2, 2010
  • Accelerated Reconstruction Using Parallel Computing for Spiral Spectroscopic Imaging
  • Dong H. Kim, Yun H. Oh, Yun H. Nam, M. Gu, and Won W. Ro
  • In Proc. of 2010 International Society for Magnetic Resonance in Medicine Annual Meeting
  • (2010 ISMRM Annual Meeting)
  • Stockholm, Sweden, May 1 - 7, 2010
  • FPGA Implementation of Highly Parallelized Decoder Logic for Network Coding
  • Sunwoo Kim and Won W. Ro
  • In Proc. of Eighteenth ACM/SIGDA International Symposium on Field-Programmable Gate Arrays
  • (FPGA 2010)
  • Monterey, USA, Feb. 21 - 23, 2010



2009

Journal Papers

  • A Complexity-Effective Microprocessor Design with Decoupled Dispatch Queues and Prefetching
  • Won W. Ro and Jean-Luc Gaudiot
  • Parallel Computing, Vol. 35, No. 5, pp. 255-268, May 2009

Conference Papers

  • Evaluation of Cache Coherence Protocols on Multi-Core Systems with Linear Workloads
  • Yong J. Jang and Won W. Ro
  • In Proc. of 2009 International Colloquium on Computing, Communication, Control, and Management
  • (CCCM 2009)
  • Sanya, China, Aug. 8 - 9, 2009
  • Comparing Open Source Web Services: gSoap and AXIS
  • Jongwook Woo and Won W. Ro
  • In Proc. of the 24th International Technical Conference on Circuits/Systems, Computers and Communications
  • (ITC-CSCC 2009)
  • Jeju Island, Korea, July 5 - 8, 2009
  • Efficient Parallelized Network Coding for P2P File Sharing Applications
  • Karam Park, Joon-Sang Park, and Won W. Ro
  • In Proc. of the 4th International Conference on Grid and Pervasive Computing
  • (GPC 2009)
  • Geneva, Switcherland, May 4 - 8, 2009
  • Fully Pipelined Hardware Implementation of 128-bit SEED Block Cipher Algorithm
  • Jaeyoung Yi, Karam Park, Joonseok Park, and Won W. Ro
  • In Proc. of the 5th International Workshop on Applied Reconfigurable Computing
  • (ARC 2009)
  • Karlsruhe, Germany, Mar. 16 - 18, 2009

Book Chapters

  • Programmability and Scalability on Multi-Core Architectures
  • Jaeyoung Yi, Yong J. Jang, Doohwan Oh, and Won W. Ro
  • Chapter in "Handbook of Research on Scalable Computing Technologies", edited by Kuan-Ching Li, Ching-Hsien Hsu, Laurence Tianruo Yang, Jack Dongarra, and Hans Zima, Information Science Reference, 2009



2008

Journal Papers

  • Efficient Peer-to-Peer File Sharing Using Network Coding in MANET
  • Uichin Lee, Joon-Sang Park, Seung-Hoon Lee, Won W. Ro, Giovanni Pau, and Mario Gerla
  • Journal of Communications and Networks, Vol. 10, No. 4, Dec. 2008
  • A Low-Complexity Microprocessor Design with Speculative Pre-Execution
  • Won W. Ro and Jean-Luc Gaudiot
  • Journal of Systems Architecture, Vol. 54, No. 12, pp. 1101-1112, Dec. 2008
  • Performance Evaluation of Programming Models for SMP-Based Clusters
  • Myungho Lee, Neungsoo Park, Won W. Ro, and Kuan-Ching Li
  • Journal of the Chinese Institute of Engineers, Vol. 31, No. 7, pp. 1181-1188, Dec. 2008
  • Simultaneous Thin-Thread Processors for Low-Power Embedded Systems
  • Won W. Ro, Jaeyoung Yi, Joon-Sang Park, and Joonseok Park
  • IEICE Electronics Express, Vol. 5, No. 19, pp. 802-808, Oct. 2008
  • Delay Analysis of Car-to-Car Reliable Data Delivery Strategies Based on Data Mulling with Network Coding
  • Joon-Sang Park, Uichin Lee, Soon Young Oh, Mario Gerla, Desmond Siumen Lun, Won W. Ro, and Joonseok Park
  • IEICE Transactions on Information and Systems, Vol. E91-D, No. 10, Oct. 2008

Conference Papers

  • Parallel Algorithms for Steiner Tree Problem
  • Joon-Sang Park, Won W. Ro, Handuck Lee, and Neungsoo Park
  • In Proc. of the 3rd International Conference on Convergence and Hybrid Information Technology
  • (ICHIT 2008)
  • Busan, Korea, Nov. 11 - 13, 2008



2006

Journal Papers

  • Design and Evaluation of a Hierarchical Decoupled Architecture
  • Won W. Ro, Stephen P. Crago, Alvin M. Despain, and Jean-Luc Gaudiot
  • Journal of Supercomputing, Springer, Vol. 38, No. 3, pp. 237-259, Dec. 2006
  • Speculative Pre-Execution Assisted by Compiler (SPEAR)
  • Won W. Ro and Jean-Luc Gaudiot
  • Journal of Parallel and Distributed Computing, Elsevier, Vol. 66, No. 8, pp. 1076-1089, Aug. 2006

Conference Papers

  • Design and Effectiveness of Small-Sized Decoupled Dispatch Queues
  • Won W. Ro and Jean-Luc Gaudiot
  • In Proc. of European Conference on Parallel Computing - LNCS
  • (EURO-PAR 2006)
  • Dresden, Germany, Aug. 29 - Sep. 1, 2006


2005

Conference Papers

  • A Low-Complexity Issue Queue Design with Speculative Pre-Execution
  • Won W. Ro and Jean-Luc Gaudiot
  • In Proc. of the 12th International Conference on High Performance Computing
  • (HiPC 2005)
  • Goa, India, Dec. 18 - 21, 2005

Book Chapters

  • Techniques to Improve Performance Beyond Pipelining: Superpipelining, Superscalar, and VLIW
  • Jean-Luc Gaudiot, Jung-Yup Kang, and Won Woo Ro
  • Chapter in "Computer Architecture", a volume of "Advance in Computers", edited by Ali R.Hurson, Elsevier, 2005


2004

Conference Papers

  • SPEAR: A Hybrid Model for Speculative Pre-Execution
  • Won W. Ro and Jean-Luc Gaudiot
  • In Proc. of the 18th International Parallel and Distributed Processing Symposium
  • (IPDPS 2004)
  • Santa Fe, New Mexico, 2004


2003

Conference Papers

  • HiDISC: A Decoupled Architecture for Data-Intensive Applications
  • Won W. Ro, Jean-Luc Gaudiot, Stephen P. Crago, and Alvin M. Despain
  • In Proc. of the 17th International Parallel and Distributed Processing Symposium
  • (IPDPS 2003)
  • Nice, France, Apr. 22 - 26, 2003
  • Compiler Support for Dynamic Speculative Pre-Execution
  • Won W. Ro and Jean-Luc Gaudiot
  • In Proc. of the 7th Annual Workshop on Interaction between Compilers and Computer Architectures
  • (INTERACT-7) in conjunction with HPCA-9
  • Anaheim, California, Feb. 8, 2003


2000

Conference Papers

  • Memory Latency: to Tolerate or to Reduce?
  • Amol Bakshi, Jean-Luc Gaudiot, Wen-Yen Lin, Manil Makhija, Viktor K. Prasanna, Wonwoo Ro, and Chulho Shin
  • In Proc. of the 12th Symposium on Computer Architecture and High Performance Computing
  • (SBAC-PAD'00)
  • Sao Pedro, Brazil, Oct. 24 - 27, 2000
  • A High-Performance, Hierarchical Decoupled Architecture
  • Stephen P. Crago, Alvin Despain, Jean-Luc Gaudiot, Manil Makhija, Wonwoo Ro, and Apoorv Srivastava
  • In Proc. of the Memory access Decoupling for superscalar and multiple issue Architectures
  • (MEDEA) Workshop in conjunction with PACT 2000
  • Philadelphia, Oct. 15, 2000
  • A Reliable Cluster Computing with a New Checkpointing RAID-x Architecture
  • Kai Hwang, Hai Jin, Roy Ho, and Wonwoo Ro
  • In Proc. of the 9th Heterogeneous Computing Workshop
  • (HCW)
  • Cancun, Mexico, May 1, 2000