Programming Languages,
Architecture and
Compilers
Education Laboratory
Publications
Reducing Write Barrier Overheads for Orthogonal Persistence Yilin Zhang, Omkar Dilip Dhawal, V. Krishna Nandivada, Shigeru Chiba, Tomoharu Ugawa SLE, October 20-21, 2024, Pasadena, California, USA
The ART of Sharing Points-to Analysis: Reusing Points-to Analysis Results Safely and Efficiently Shashin Halalingaiah, Vijay Sundaresan, Daryl Maier, V. Krishna Nandivada OOPSLA, October 20-25, 2024, Pasadena, California, USA
Automated Robustness Verification of Concurrent Data Structure Libraries Against Relaxed Memory Models Kartik Nagar, Anmol Sahoo, Romit Roy Chowdhary, Suresh Jagannathan OOPSLA, October 20-25, 2024, Pasadena, California, USA
Selective Memory Compression for GPU Memory Oversubscription Management Abdun Nihaal, Madhu Mutyam International Conference on Parallel Processing (ICPP), August 12-15, 2024, Gotland, Sweden
Cache Line Pinning for Mitigating Row Hammer Attack Praseetha M, Venkata Kalyan Tavva, Madhu Mutyam International Conference on Parallel Processing (ICPP), August 12-15, 2024, Gotland, Sweden
FlexiGran: Flexible Granularity Locking in Hierarchies Anju M A, Rupesh Nasre International European Conference on Parallel and Distributed Computing (EuroPar), August 26-30, 2024, Madrid, Spain
Homeostasis: Design and Implementation of a Self-Stabilizing Compiler Aman Nougrahiya, V. Krishna Nandivada ACM Transactions on Programming Languages and Systems (TOPLAS), 2024
COWS for High Performance: Cost Aware Work Stealing for Irregular Parallel Loop Prasoon Mishra, V. Krishna Nandivada ACM Transactions on Architecture and Code Optimization (TACO), 2024
UWOmppro: UWOmp++ with Point-to-Point Synchronization, Reduction and Schedules Aditya Agrawal, V. Krishna Nandivada International Conference on Parallel Architectures and Compilation Techniques (PACT), October 21-25, 2023, Vienna, Austria
Single-linkage clustering of dynamic data Anju M A, Rupesh Nasre Concurrency and Computation: Practice and Experience, 2023
Scaling the Maximum Flow Computation on GPUs Jash Khatri, Arihant Samar, Bikash Behera, Rupesh Nasre International Journal of Parallel Programming, 2022
Multi Interval DomLock (MID): Towards Improving Concurrency in Hierarchies Anju M A, Rupesh Nasre ACM Transactions on Parallel Computing (TOPC), 2022
Colosseum: Regression Test Prioritization by Delta Displacement in Test Coverage Shouvick Mondal, Rupesh Nasre IEEE Transactions on Software Engineering, 2022
Certified Mergeable Replicated Data Types Vimala Soundarapandian, Adharsh Kamath, Kartik Nagar, KC Sivaramakrishnan ACM Conference on Programming Language Design and Implementation (PLDI), June 13-17, 2022, San Diego, USA
Hansie: Hybrid and Consensus Regression Test Prioritization Shouvick Mondal, Rupesh Nasre Journal of Systems and Software (JSS), 2020 and IEEE International Conference on Software Testing, Verification and Validation (ICST), 2021 (under the journal-first track)
Router Buffer Caching for Managing Shared Cache Blocks in Tiled Multi-Core Processors Joe Augustine, Raghavendra Kanakagiri, John Jose, Madhu Mutyam IEEE International Conference on Computer Design (ICCD), 2020
DisGCo: A Compiler for Distributed Graph Analytics Anchu Rajendran, V. Krishna Nandivada ACM Transactions on Architecture and Code Optimization (TACO), 2020
Optimization of Inter Cache Traffic Entanglement in Tagless Caches with Tiling Opportunities S.R. Swamy Saranam Chongala, Sumitha George, Hariram Thirucherai Govind, Jagadish Kotra, Madhu Mutyam, John Samson, Mahmut Kandemir, Vijaykrishnan Narayanan International Conference on Compilers, Architectures, and Synthesis for Embedded Systems (CASES), September 20-25, 2020, Hamburg, Germany
Fuzzy Fairness Controller for NVMe SSDs Shivani Tripathy, Debiprasanna Sahoo, Manoranjan Satpathy, Madhu Mutyam ACM International Conference on Supercomputing (ICS), June 29-July 2, 2020
Graffix: Efficient Graph Processing with a Tinge of GPU-specific Approximations Somesh Singh, Rupesh Nasre International Conference on Parallel Processing (ICPP), Aug 17-20, 2020, Edmonton, Canada
Semantics, Specification and Bounded Verification of Concurrent Libraries in Replicated Systems Kartik Nagar, Prasita Mukherjee, Suresh Jagannathan International Conference on Computer Aided Verification (CAV), July 19-24, 2020, Los Angeles, California, USA
Orion: Time Estimated Causally Consistent Key-Value Store Diptanshu Kakwani, Rupesh Nasre Workshop on Principles and Practice of Consistency for Distributed Data (PaPoC), April 27, 2020, Heraklion, Crete, Greece
OPAL: An extensible framework for ontology-based program analysis Dileep Kumar Pattipati, Rupesh Nasre, P. Sreenivasa Kumar Software: Practice and Experience (SPE), 2020
High-Speed, Two-Dimensional Digital Image Correlation Algorithm Using Heterogeneous (CPU-GPU) Framework Mullai Thiagu, Shankar Subramanian, Rupesh Nasre Strain, an International Journal of Experimental Mechanics, 2020
A Scalable and Energy-efficient Concurrent Binary Search Tree with Fatnodes Praveen Alapati, Venkata Kalyan Tavva, Madhu Mutyam IEEE Transactions on Sustainable Computing (T-SUSC), 2020
Mix Your Contexts Well: Opportunities Unleashed by Recent Advances in Scaling Context-Sensitivity Manas Thakur, V Krishna Nandivada International Conference on Compiler Construction (CC), Feb 22-23, 2020, San Diego, USA
A Study of Graph Analytics for Massive Datasets on Large-Scale Distributed GPUs V Jatala, R Dathathri, G Gill, L Hoang, V Krishna Nandivada and K Pingali To appear in the IEEE International Parallel and Distributed Processing Symposium (IPDPS), May 18-22, 2020, New Orleans, USA
On the fly MHP Analysis Sonali Saha, V Krsihna Nandivada To appear in the International Conference on Principles and Practice of Parallel Programming (PPoPP), Feb 22-26, 2020, San Diego, USA
SixTrack V and runtime environment R. De Maria , J. Andersson , V. K. Berglyd Olsen , L. Field , M. Giovannozzi , P. D. Hermes , N. Høimyr , S. Kostoglou , G. Iadarola , E. Mcintosh , A. Mereghetti , J. Molson , D. Pellegrini , T. Persson , M. Schwinzerl , E. H. Maclean , K. N. Sjobak , I. Zacharov and Somesh Singh International Journal of Modern Physics A, 2019
Batch alias analysis Jyothi Vedurada, V Krishna Nandivada International Conference on Automated Software Engineering (ASE), Nov 11-15, 2019, San Diego, USA
Mahtab: Phase-wise Acceleration of Regression Testing for C Shouvick Mondal, Rupesh Nasre Journal of Systems and Software, 2019
Concurrent Treaps and Impact of Locking Objects Praveen Alapati, Madhu Mutyam New Generation Computing Journal, 2019
Endurance Enhancement of Write-Optimized STT-RAM Caches Puneet Saraf, Madhu Mutyam International Symposium on Memory Systems (MemSys), Sept 30 - Oct 03, 2019, Washington, DC, USA
Precise, Efficient and Secure Just-In-Time Analysis of Java Programs Manas Thakur, V Krishna Nandivada Poster in European Conference on Object Oriented Programming (ECOOP), July 15-19, 2019, Hammersmith, London, UK
Optimizing Remote Communication in X10 Arun T, V Krishna Nandivada ACM Transactions on Architecture and Code Optimization (TACO), 2019
Gluon-Async: A Bulk-Asynchronous System for Distributed and Heterogeneous Graph Analytics R Dathathri, G Gill, L Hoang, H Dang, V Jatala, V K Nandivada, M Snir and K Pingali International Conference on Parallel Architectures and Compilation Techniques (PACT), September 23-26, 2019, Seattle, USA
Variable Sized Cache Block Compaction Sayantan Ray, Madhu Mutyam International Conference on Parallel Architectures and Compilation Techniques (PACT), September 23-26, 2019, Seattle, USA
Efficient Lock-Step Synchronization in Task-Parallel Languages Akshay Utture, V Krishna Nandivada Software Practice and Experience (SPE), 2019
Graph Coloring using GPUs Meghana Sistla, V Krishna Nandivada European Conference on Parallel and Distributed Computing (Euro-Par), August 26-30, 2019, Gottingen, Germany
Contention-aware Task Scheduler for Concurrent Hierarchical Operations Saurabh Kalikar, Rupesh Nasre European Conference on Parallel and Distributed Computing (Euro-Par), August 26-30, 2019, Gottingen, Germany
Regression Event Selection for Android Applications Aman Sharma, Rupesh Nasre International Syposium on Software Testing and Analysis (ISSTA), July, 2019, Beijing, China
PYE: A Framework for Precise-Yet-Efficient Just-In-Time Analyses for Java Programs Manas Thakur, V Krishna Nandivada ACM Transactions on Programming Languages and Systems (TOPLAS), 2019
Compare Less, Defer More: Scaling Value-Contexts Based Whole-Program Heap Analyses Manas Thakur, V Krishna Nandivada International Conference on Compiler Construction (CC), Feb 16-17, 2019, Washington, DC, USA
Efficiency and Expressiveness in UW-OpenMP Raghesh Aloor, V Krishna Nandivada International Conference on Compiler Construction (CC), Feb 16-17, 2019, Washington, DC, USA
SixTrack Project: Status, Runtime Environment and New Developments R. De Maria, J. Andersson, L. Field, M. Giovannozzi, P.D. Hermes, N. Hoimyr, G. Iadarola, S. Kostoglou, E.H. Maclean, E. McIntosh, A. Mereghetti, J. Molson, V.K.B. Olsen, D. Pellegrini, T. Persson, M. Schwinzerl, K.N. Sjobak (CERN), E.H. Maclean (University of Malta), Somesh Singh, K.N. Sjobak (University of Oslo) and I. Zacharov (EPFL) ICAP, 2018
ReDRAM: A Reconfigurable DRAM Cache for GPGPUs Debiprasanna Sahoo, Swaraj Sha, Manoranjan Satpathy, Madhu Mutyam IEEE Computer Architecture Letters, 2018
Formal Modeling and Verification of Controllers for a Family of DRAM Caches Debiprasanna Sahoo, Swaraj Sha, Manoranjan Satpathy, Madhu Mutyam, Ramesh S, and Partha Roop International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS), Sept 30 - Oct 5, 2018, Torino, Italy
Identifying Refactoring Opportunities for Replacing Type Code with Subclass and State Jyothi Vedurada, V Krishna Nandivada PACMPL (OOPSLA), Nov 7-9, 2018, Boston, US
Pyramid-based Initial Guess Estimation for 2D-DIC Algorithm realized using Heterogeneous (CPU-GPU) Framework Mullai Thiagu, Sankara Subramanian and Rupesh Nasre iDICS conference, Oct 15-18, 2018, China
Optimizing Graph Algorithms in Asymmetric Multicore Processors Jyothi Krishna V S and Rupesh Nasre Special issue of TCAD, ACM SIGBED International Conference on Embedded Software (EMSOFT), Sept 30 - Oct 5, 2018, Italy
Optimizing Remote Data Transfers in X10 T Arun, V Krishna Nandivada International Conference on Parallel Architectures and Compilation Techniques, November 1-4, 2018, Limassol, Cyprus
Towards Optimal Multi-Granularity Locking in Hierarchies Saurabh Kalikar, Rupesh Nasre International Conference on Parallel Processing, August 13-16, 2018, Oregon, US
Multi-Granularity Locking in Hierarchies with Synergistic Hierarchical and Fine-Grained Locks Ganesh K, Saurabh Kalikar and Rupesh Nasre EURO-PAR, August 27-31, 2018, Turin, Italy
TDC:Tagless DRAM Cache S R Swamy Saranam Chongala, Madhu Mutyam IEEE Computer Society Annual Symposium on VLSI, July 9-11, 2018, Hong Kong SAR, China
Finding Shortest Triangular Path and its Family inside a Digital Object Apurba Sarkar, Arindam Biswas, Mousumi Dutt, and Shouvick Mondal Fundamenta Informaticae, 2018
TTLG: An Efficient Tensor Transposition Library for GPUs Jyothi Vedurada, Arjun Suresh, Aravind Sukumaran Rajam, Jinsung Kim, Changwan Hong, Sriram Krishnamoorthy, V. Krishna Nandivada, Ajay Panyala, Rohit Srivastava and P Sadayappan 32nd IEEE International Parallel & Distributed Processing Symposium (IPDPS), May 21–25, 2018, Vancouver, British Columbia, Canada
Scalable and Performant Graph Processing on GPUs using Approximate Computing Somesh Singh and Rupesh Nasre IEEE Transactions on Multiscale Computing Systems (TMSCS), 2018
CHOAMP: Cost Based Hardware Optimization for Asymmetric Multicore Processors Jyothi Krishna V S, Shankar Balachandran and Rupesh Nasre IEEE Transactions on Multiscale Computing Systems (TMSCS), 2018
MBZip: Multi-Block Data Compression Raghavendra K, Biswabandan Panda and Madhu Mutyam ACM Transactions on Architecture and Code Optimization (TACO), 2017
RCTP: Region Correlated Temporal Prefetcher Dennis Antony Varkey, Biswabandan Panda and Madhu Mutyam 35th IEEE International Conference on Computer Design (ICCD), November 5-8, 2017, Boston, Massachusetts, USA
Energy Efficient Compilation of Irregular Task-Parallel Loops Rahul Shrivastava and V Krishna Nandivada ACM Transactions on Architecture and Code Optimization (TACO), 2017
DCEIL: Distributed Community Detection with the CEIL Score Akash Jain, Rupesh Nasre, and Ravindran Balaraman 18th IEEE Conference on High Performance Computing and Communications (HPCC), December 18-20, 2017, Bangkok, Thailand
FatCBST: Concurrent Binary Search Tree with Fatnodes Praveen Alapati, Venkata Kalyan Tavva and Madhu Mutyam 18th IEEE Conference on High Performance Computing and Communications (HPCC), December 18-20, 2017, Bangkok, Thailand
DomLock: A New Multi-Granularity Locking Technique for Hierarchies Saurabh Kalikar and Rupesh Nasre ACM Transactions on Parallel Computing, 2017
Concurrent Treaps Praveen Alapati, S R Swamy Saranam and Madhu Mutyam To appear in ICA3PP UCER 2017, August 21-23, 2017, Helsinki, Finland
Optimizing Recursive Task Parallel Programs Suyash Gupta, Rahul Shrivastava and V Krishna Nandivada International Conference on Supercomputing, June 14-16, 2017, Chicago, USA
Identifying Use-After-Free Variables in Fire-and-Forget Tasks Jyothi Krishna V S and Vassily Litvinov Chapel Implementers and Users Workshop, June 2-3, 2017, Orlando, Florida, USA
Automatic Code Generation for Graph Algorithms on GPUs Shashidhar G and Rupesh Nasre International Workshop on Languages and Compilers for Parallel Computing (LCPC), September 28-30, 2016, Rochester, NY, USA
EagerMerge: An Optimistic Technique for Efficient Points-to Analysis Sudhir Samrit and Rupesh Nasre International Symposium on Software Testing and Analysis (ISSTA), July 18-20, 2016, Germany
Improved MHP Analysis Aravind Sankar, Soham Chakraborty, V Krishna Nandivada International Conference on Compiler Construction (CC), March 17-18, 2016, Barceona, Spain
CASCADE: Congestion Aware Switchable Cycle Adaptive Deflection Router Gnaneswara Rao Jonna, Vamana Murthi Thuniki and Madhu Mutyam International Conference on Architecture of Computing Systems (ARCS), April 4-7, 2016, Nuremberg, Germany
GPU Centric Extensions for Parallel Strongly Connected Components Computation Shrinivas Devshatwar, Madhur Amilkanthwar and Rupesh Nasre GPGPU 2016, March 12, 2016, Barcelona, Spain
DomLock: A New Multi-Granularity Locking Technique for Hierarchies Saurabh Kalikar and Rupesh Nasre Principles and Practice of Parallel Programming (PPoPP), March 12-16, 2016, Barcelona, Spain
PBC: Prefetched Block Compaction K. Raghavendra, Biswabandan Panda and Madhu Mutyam IEEE Transactions on Computers, 2015
MBZip: A Case for Compressing Multiple Data Blocks K. Raghavendra, Biswabandan Panda and Madhu Mutyam International Conference on Parallel Architectures and Compilation Techniques (PACT), October 18-21, 2015, San Franciso
Efficient online cycle detection technique combining with Steensgaard points-to information Fei Lui, Bixin Li and Rupesh Nasre Journal of Software: Practice and Experience, 2015
SkipCache: Application Aware Cache Management for Chip Multi-Processors Tripti S. Warrier, K. Raghavendra and Madhu Mutyam IET Computers & Digital Techniques(CDT), pp. 293-299, 2015
Unique Worker model for OpenMP Raghesh Aloor and V Krishna Nandivada International conference on Supercomputing (ICS), pp. 47-56, June 8-10, 2015, Newport Beach, CA, USA
Patent: Systems and methods for automatically optimizing high performance computing programming languages (US patent no: 8924946) Ganesh Bikshandi, V Krishna Nandivada, Igor Peshansky and Vijay Saraswat
Loop tiling in the presence of exceptions Abhilash Bhandari and V Krishna Nandivada European Conference on Object-Oriented Programming (ECOOP), pp. 124-128, July 5-10, 2015, Prague, Czech Republic
Lexical State Analyzer for JavaCC grammars K Gupta and V K Nandivada Software: Practice and Experience, 2015
EFGR: An Enhanced Fine Grain Granularity Refresh Feature for High Performance DDR4 DRAM Devices T.V. Kalyan, K. Ravi, and Madhu Mutyam 10th HiPEAC Conference, January 19-21, 2015, Amsterdam, Netherlands
IMSuite: A Benchmark Suite for Simulating Distributed Algorithms Suyash Gupta and V Krishna Nandivada Journal of Parallel and Distributed Computing, 2013
SFFMap: Set-First Fill Mapping for an Energy Efficient Pipelined Date Cache Pritam Majumder, Venkata Kalyan T and Madhu Mutyam IEEE International Conference on Computer Design (ICCD), pp. 104-109, 2014
EFGR: An Enhanced Fine Granularity Refresh Feature for High Performance DDR4 DRAM Devices Venkata Kalyan T, Ravi Kasha and Madhu Mutyam ACM Transactions on Architecture and Code Optimization (TACO), pp. 31:1-31:26, 2014
Implementation and Analysis of History Based Output Channel Selection Function for Adaptive Routers in Mesh NoCs John Jose and Madhu Mutyam ACM Transactions on Design Automation of Electronic Systems (TODAES), pp. 35:1-35:22, 2014
Auto-Parallelization of Data Structure Operations for GPUs Rupesh Nasre International Conference on Compilers, Architectures and Synthesis of Embedded Systems (CASES), pp. 7:1-7:10, October 12-17, 2014, New Delhi, India
Data Remapping for an Energy Efficient Burst Chop in DRAM Memory Systems Sudharsan J, Venkata Kalyan T, and Madhu Mutyam ACM Student Research Competition (SRC), International Conference on Parallel Architectures and Compilation Techniques (PACT), August 24-27, 2014, Edmonton, Canada
Using Packet Information for Efficient Communication in NoCs Prasanna Venkatesh and Madhu Mutyam International Symposium on Networks-on-Chip (NOCS), pp. 43-150, September 17-19, 2014, Ferrera, Italia
Improving Fairness in Memory Scheduling Using a Team of Learning Automata Aditya Kajwe and Madhu Mutyam The Memory Forum, Co-located with ISCA, June 14, 2014, Minneapolis, Minnesota.
Push-Pull Constraint Graph for Efficient Points-to Analysis Bollu Ratnakar and Rupesh Nasre International Symposium on Memory Management (ISMM), pp. 25-33, June 12, 2014, Edinburgh, Scotland, UK.
Patent: Model, System and Program Storage Device for automatic incremental learning of Programming language grammar (US patent no : 8676826) V Krishna Nandivada, Pankaj Dhoolia, Mangala Gowri, and Diptikalyan Saha
Patent: System and Method for Dynamic Code Analysis in presence of the "table processing" idiom (US patent no : 8583965) V Krishna Nandivada, Pankaj Dhoolia, Mangala Gowri, and Diptikalyan Saha
Patent: Method, System and Program Storage Device that Provide for Automatic Programming Language Grammar Partitioning (US patent no : 8516457) V Krishna Nandivada, Pankaj Dhoolia, Mangala Gowri, and Diptikalyan Saha
SAMO: Store Aware Memory Optimizations. Raghavendra, Tripti Warrier, and Madhu Mutyam ACM International Conference on Computing Frontiers (CF), pp. 33:1-33:10, May 20-22, 2014, Cagliari, Italy
Improved Bitwidth-Aware Variable Packing. V Krishna Nandivada, Rajkishore Barik High Performance and Embedded Architecture and Compilation conference (HiPEAC), Jan 20-22, 2014, Vienna, Austria.
Minimally Bufferred Single-Cycle Deflection Router for Mesh NoCs. Gnaneswara Rao Jonna, John Jose, Rachana Radhakrishnan, Madhu Mutyam International Conference on Design, Automation & Test in Europe (DATE), Mar 24-28, 2014, Dresden, Germany (accepted as a poster).
Scattered Refresh: An Alternative Refresh Mechanism to Reduce Refresh Cycle Time. T.V. Kalyan, K. Ravi, Madhu Mutyam 19th Asia and South Pacific Design Automation Conference (ASP-DAC), pp. 598-603, Jan 20-23, 2014, Singapore.
SLIDER: Smart Late Injection DEflection Router for Mesh NoCs Bhawna Nayak, John Jose, Madhu Mutyam 31st IEEE International Conference on Computer Design (ICCD), pp. 377-383, Oct 6-9, 2013, Asheville, NC, USA.
Improved Bitwidth-Aware Variable Packing V K Nandivada, R Barik ACM Transactions on Architecture and Code Optimization (TACO), pp. 16:1-16:22, 2013
A Transformation Framework for Optimizing Task-Parallel Programs. V. Krishna Nandivada, Jun Shirako, Jisheng Zhao, Vivek Sarkar ACM Trans. Program. Lang. Syst. 35(1) 3, 2013
An Application-Aware Cache Replacement Policy for Last-Level Caches Tripti Warrier, B. Anupama, Madhu Mutyam ARCS 2013, pp. 207-219, 2013
DeBAR: deflection based adaptive router with minimal buffering John Jose, Bhawna Nayak, Kranthi Kumar, Madhu Mutyam DATE 2013, pp. 1583-1588, 2013
Fibonacci Codes for Crosstalk Avoidance Madhu Mutyam IEEE Trans. VLSI Syst. 20(10), pp. 1899-1903, 2012
TRACKER: A low overhead adaptive NoC router with load balancing selection strategy John Jose, K. V. Mahathi, J. Shiva Shankar, Madhu Mutyam ICCAD, pp. 564-568, 2012
Way Sharing Set Associative Cache Architecture C. J. Janraj, T. Venkata Kalyan, Tripti Warrier, Madhu Mutyam VLSI Design, pp. 251-256, 2012
Identifying Services from Business Applications R Komondoor, V K Nandivada, S Sinha, John Field India Software Engineering Conference, 2012