Publications

Conference Papers

Journal Papers

Technical Memos

Technical Notes

Theses

 

 

 

 



Conference Papers

"A Parallel Dynamic Programming Algorithm on a Multi-core Architecture"
The 19th Annual ACM Symposium on Parallelism in Algorithms and Architectures (SPAA '07), San Diego, CA, USA, June 9-11, 2007
Guangming Tan, Ninghui Sun, and Guang R. Gao

"Synchronization State Buffer: Supporting Efficient Fine-Grain Synchronization for Many-Core Architectures"
The 34th International Symposium on Computer Architecture (ISCA2007), San Diego, CA, USA, June 9-13, 2007
Weirong Zhu, Vugranam C. Sreedhar, Ziang Hu, and Guang R. Gao
Available in pdf format

"Exploring Financial Applications on Many-core-on-a-chip Architecture: A First Experiment"
Workshop on Frontiers of High Performance Computing and Networking (FHPCN2006), in Proceedings of 4th International Symposium on Parallel and Distributed Processing and Applications (ISPA-06) , Sorrento, Italy, Dec.4-7, 2006; (Lecture Notes in Computer Science, Vol. 4331, pp.221-230, 2006)
Weirong Zhu, Parimala Thulasiraman, Ruppa K. Thulasiram and Guang R. Gao
Available in pdf format

"On the Role of Deterministic Fine Grain Data Synchronization for Scientific Applications: A Revisit in the Emerging Many-Core Era"
First Workshop on Multithreaded Architecures and Applications in the 21st IEEE International Parallel and Distributed Processing Symposium (IPDPS 2007), March 26th - 30th, 2007, Long Beach, California USA
Weirong Zhu, Ziang Hu, and Guang Gao

"Exploring a multithreaded Methodology to Implement a Network Communication Protocol on the Cyclops-64 Multithreaded Architecture"
First Workshop on Multithreaded Architectures and Applications in the 21st IEEE International Parallel and Distributed Processing Symposium (IPDPS 2007), March 26th - 30th, 2007, Long Beach, California USA
Ge Gan, Ziang Hu, Juan del Cuvillo, and Guang Gao
Also available in pdf format

"Experience of Optimizing FFT on Intel Core Architecture"
Workshop on Performance Optimization for High-Level Languages and Libraries in the 21st IEEE International Parallel and Distributed Processing Symposium (IPDPS 2007), March 26th - 30th, 2007, Long Beach, California USA
Daniel Orozco, Liping Xue, Murat Bolat, Xiaoming Li and Guang Gao
Also available in pdf format

"Automatic Program Segment Similarity Detection in Targeted Program Performance Improvement"
Workshop on Performance Optimization for High-Level Languages and Libraries in the 21st IEEE International Parallel and Distributed Processing Symposium (IPDPS 2007), March 26th - 30th, 2007, Long Beach, California USA
Haiping Wu, Eunjung Park, Mihailo Kaplarevic, Yingping Zhang, Murat Bolat, Xiaoming Li and Guang Gao
Also available in pdf format

"Optimizing Fast Fourier Transform on a Multi-core Architecture"
Workshop on Performance Optimization for High-Level Languages and Libraries in the 21st IEEE International Parallel and Distributed Processing Symposium (IPDPS 2007), March 26th - 30th, 2007, Long Beach, California USA
Long Chen and Ziang Hu
Also available in pdf format

"A User-Friendly Methodology for Automatic Exploration of Compiler Options"
The International Conference on Programming Languages and Compilers (PLC06). Las Vegas, Nevada. June 26-29, 2006
Haiping Wu, Long Chen, Joseph Manzano, and Guang Gao
Also available in pdf format

"A User-Friendly Methodology for Automatic Exploration of Compiler Options: A Case Study on the Intel XScale Microarchitecture"
The International Conference on Programming Languages and Compilers (PLC06). Las Vegas, Nevada. June 26-29, 2006
Haiping Wu, Eunjung Park, Long Chen, Juan del Cuvillo, and Guang Gao
Also available in pdf format

"Optimization of Dense Matrix Multiplication on IBM Cyclops-64: Challenges and Experiences"
The 12th International European Conference on Parallel Processing (Euro-Par2006), Dresden, Germany, August 29 - September 1, 2006
Ziang Hu, Juan del Cuvillo, Weirong Zhu, and Guang R. Gao
Also available in pdf format

"Performance Characteristics of OpenMP Language Constructs on a Many-core-on-a-chip Architecture"
The 2nd International Workshop on OpenMP (IWOMP2006), Remis, France, June 12-25 2006
Weirong Zhu, Juan del Cuvillo, and Guang R. Gao
Also available in pdf format

"Multi-Dimensional Kernel Generation for Loop Nest Software Pipelining"
The 12th International European Conference on Parallel Processing (Euro-Par2006), Dresden, Germany, August 29 - September 1, 2006
Alban Douillet, Hongbo Rong, and Guang R. Gao
Also available in pdf format

"Towards a Software Infrastructure for the Cyclops-64 Cellular Architecture"
The 20th International Symposium on High Performance Computing Systems and Applications (HPCS'06), May 14 - 17, 2006
Juan del Cuvillo, Weirong Zhu, Ziang Hu, and Guang R. Gao
Also available in pdf format

"Landing OpenMP on Cyclops-64: An Efficient Mapping of OpenMP to a many-core System-on-a-chip"
The 3rd ACM International Conference on Computing Frontiers, Ischia, Italy, May 2-5, 2006
Juan del Cuvillo, Weirong Zhu, Guang R. Gao
Also available in pdf format

"A Study of the On-Chip Interconnection Network for the IBM Cyclops-64 Multi-Core Architecture"
In the Proceedings of 20th International Parallel and Distributed Processing Symposium (IPDPS2006), Rhodes Island, Greece, April 25 - 29, 2006
Ying M. P. Zhang, Taikyeong Jeong, Fei Chen, Haiping Wu, Ronny Nitzsche, and Guang R. Gao
Also available in pdf format

"Hierarchical Multithreading: Programming Model and System Software"
Workshop on NSF Next Generation Software Program (NSFNGS'06), in conjunction with 20th International Parallel and Distributed Processing Symposium (IPDPS2006), Rhodes Island, Greece, April 25 - 29, 2006
Guang R. Gao, Thomas Sterling, Rick Stevens, Mark Hereld, and Weirong Zhu

"Performance Modelling and Optimization of Memory Access on Cellular Computer Architecture Cyclops-64"
Network and Parallel Computing, Beijing, China, November 30 - December 3, 2005
Yanwei Niu, Ziang Hu, Kenneth Barner, Guang R. Gao
Also available in pdf format

"Register Pressure in Software-Pipelined Loop Nests: Fast Computation and Impact on Architecture Design"
The 18th International Workshop on Languages and Compilers for Parallel Computing (LCPC'05), Hawthorne, New York, USA, October 20-22, 2005
Alban Douillet and Guang R. Gao
Also available in pdf format

"Identifying Multiply-Add Operations in Kylin Compiler"
In the proceedings of the 2005 International Conference on Embedded Systems and Applications(ESA'05),Monte Carlo Resort, Las Vegas, Nevada, USA, June 27-30, 2005
Haiping Wu, Ziang Hu, Joseph Manzano Yingping Zhang and Guang R. Gao

"Register Allocation for Software Pipelined Multi-dimensional Loops"
Programming Language Design and Implementation (PLDI), Chicago, Illinois, June 11 - 15, 2005.
Hongbo Rong, Alban Douillet, and Guang R. Gao
Also available in pdf format

"FAST: A Functionally Accurate Simulation Toolset for the Cyclops-64 Cellular Architecture"
Workshop on Modeling, Benchmarking and Simulation (MoBS), held in conjunction with the 32nd Annual International Symposium on Computer Architecture (ISCA'05), Madison, Wisconsin, June 4, 2005.
Juan del Cuvillo, Weirong Zhu, Ziang Hu, and Guang R. Gao
Also available in pdf format

"P3I: The Delaware Programmability, Productivity and Proficiency Inquiry"
In Proceedings of the Second International Workshop On Software Engineering for High Performance Computing System Applications (SE-HPCS '05), St. Louis, Missouri, May 15, 2005
Joseph B. Manzano, Yuan Zhang and Guang R. Gao

"Atomic Section: Concept and Implementation"
Mid-Atlantic Student Workshop on Programming Languages and Systems (MASPLAS '05), Newark, Delaware, April 30, 2005.
Yuan Zhang, Joseph B. Manzano and Guang R. Gao

"TiNy Threads: a Thread Virtual Machine for the Cyclops-64 Cellular Architecture"
Fifth Workshop on Massively Parallel Processing (WMPP), held in conjunction with the 19th International Parallel and Distributed Processing System, Denver, Colorado, April 3 - 8, 2005
Juan del Cuvillo, Weirong Zhu, Ziang Hu, and Guang R. Gao
Also available in pdf format

"Performance Portability on EARTH: A Case Study across Several Parallel Architectures"
The 4th International Workshop on Performance Modeling, Evaluation, and Optimization of Parallel and Distributed Systems (PMEO-PDS'05), conjuncted with IPDPS 2005, April 4 - 8, 2005, Denver, Colorado, USA
Weirong Zhu, Yanwei Niu, and Guang Gao

"Sequential Consistency Revisited: The Sufficient Conditions and Method to Reason Consistency Model of a Multiprocessor-on-a chip Architecture"
The IASTED International Conference on Parallel and Distributed Computing and Networks (PDCN2005), February 15 - 17, 2005, Innsbruck, Austria
Yuan Zhang, Weirong Zhu, Fei Chen, Ziang Hu, and Guang R. Gao

"If-Conversion in SSA Form"
Euro-Par 2004, 31th August - 3rd September, 2004, Pisa, Italy
Arthur Stoutchinin, and Guang R. Gao

"Single-Dimension Software Pipelining for Multi-Dimensional Loops"
2004 International Symposium on Code Generation and Optimization (CGO 2004), March 2004, San Jose, California
Hongbo Rong, Zhizhong Tang, R. Govindarajan, Alban Douillet and Guang Gao
Also available in pdf format

"Code Generation for Single-Dimension Software Pipelining of Multi-Dimensional Loops"
2004 International Symposium on Code Generation and Optimization (CGO 2004), March 2004, San Jose, California
Hongbo Rong, Alban Douillet, R. Govindarajan and Guang Gao
Also available in pdf format

"DIMES: An Iterative Emulation Platform for Multiprocessor-System-on-Chip Designs"
IEEE International Conference on Field-Programmable Technology (FPT'03), December 15-17, 2003, Tokyo, Japan
Hirofumi Sakane, Levent Yakay, Vishal Karna, Clement Leung and Guang R. Gao

"Code Size Oriented Memory Allocation for Temporary Variables"
Fifth Workshop on Media and Streaming Processors (MSP-5/MICRO-36), December 1, 2003, San Diego, California
Ziang Hu, Yan Xie and Guang R. Gao

"A Cluster-Based Solution for High Performance Hmmpfam Using EARTH Execution Model"
Fifth IEEE International Conference on Cluster Computing (CLUSTER2003), December, 2003, Hong Kong, P.R. China
Weirong Zhu, Yanwei Niu, Jizhu Lu, Chuan Shen and Guang R. Gao

"Code Size Reduction with Global Code Motion"
Workshop on Compilers and Tools for Constrained Embedded Systems (CTCES/CASES) 2003, October 29, 2003, San Jose, California
Ziang Hu, Yuan Zhang, Hongbo Yang and Guang R. Gao

"Performance Study of a Whole Genome Comparison Tool on a Hyper-Threading Multiprocessor"
Fifth International Symposium on High Performance Computing, October 20-22, 2003, Tokyo, Japan
Juan del Cuvillo, Xinmin Tian, Guang Gao and Millind Girkar

"CARE: Overview of an Adaptive Multithreaded Architecture"
Fifth International Symposium on High Performance Computing, October 20-22, 2003, Tokyo, Japan
Andres, Marquez and Guang R. Gao

"Compiler-Assisted Cache Replacement: Problem Formulation and Performance Evaluation"
16th International Workshop on Languages and Compilers for Parallel Computing(LCPC'03), October, 2003, College Station, Texas
Hongbo Yang, R. Govindarajan, Guang R. Gao and Ziang Hu

"An Executable Analytical Performance Evaluation Approach for Early Performance Prediction"
International Parallel and Distributed Processing Symposium (IPDPS'03) April 22 - 26, 2003 Nice, France
Adeline Jacquet, Vincent Janot,Clement Leung, Guang R. Gao, R. Govindarajan, Thomas L. Sterling

"Programming Models and System Software for Future High-End Computing Systems: Work-in-Progress"
International Parallel and Distributed Processing Symposium (IPDPS'03) April 22 - 26, 2003 Nice, France
Guang R. Gao, Kevin B. Theobald, R. Govindarajan, Clement Leung, Ziang Hu, Haiping Wu, Jizhu Lu, Juan del Cuvillo, Adeline Jacquet, Vincent Janot and Thomas L. Sterling

"On Achieving Balanced Power Consumption in Software Pipelined Loops"
2002 International Conference on Compilers, Architecture and Synthesis for Embedded Systems(CASES), Oct 8-11, 2002, Grenoble, France
Hongbo Yang, Guang R. Gao, Clement Leung, R. Govindarajan, Haiping Wu
Available as gzipped Postscript.

"Exploiting Schedule Slacks for Rate-Optimal Power-Minimum Software Pipelining"
3rd Workshop on Compilers and Operating Systems for Low Power(COLP), held in conjunction with The Eleventh International Conference on Parallel Architecture and Compilation Techniques(PACT), Sept 22 - 25, 2002, Charlottesville, Virginia
Hongbo Yang, R. Govindarajan, Guang R. Gao, George Cai, Ziang Hu,
Available as gzipped Postscript.

"Power-Performance Trade-offs for Energy-Efficient Architectures: A Quantitative Study"
20th International Conference on Computer Design(ICCD) 2002, September 16-18, 2002, Freiburg, Germany
Hongbo Yang, R. Govindarajan, Guang R. Gao, Kevin B. Theobald
Available gzipped Postscript.

"Whole Genome Alignment using a Multithreaded Parallel Implementation"
Symposium on Computer Architecture and High Performance Computing, September 10-12, 2001, Pirenopolis, Brazil
Wellington S. Martins, Juan del Cuvillo, Wenwu Cui and Guang R. Gao

"Power and Energy Impact by Loop Transformations"
Workshop on Compilers and Operating Systems for Low Power 2001, in conjunction with Parallel Architecture and Compilation Techniques 2001, Sept 8, 2001, Barcelona, Spain
Hongbo Yang, Guang R. Gao, Andres Marquez, George Cai, Ziang Hu
Available as gzipped Postscript.

"A Multi-Threaded Runtime System for a Multi-Processor/Multi-Node Cluster"
15th Annual International Symposium on High Performance Computing Systems and Applications, June 18-20, 2001, Windsor, ON, Canada
Christopher J. Morrone, Jos‰ Nelson Amaral, Guy Tremblay, and Guang R. Gao

"Minimum Register Instruction Sequence Problem: Revisiting Optimal Code Generation for DAGs"
International Parallel and DistributedProcessing Symposium 2001, April 24 - 28, 2001, San Francisco, CA
R. Govindarajan, Hongbo Yang, C. Zhang, J.N. Amaral, G.R. Gao
Available as and gzipped Postscript.

"Multithreaded Algorithms for Pricing a Class of Complex Options"
International Parallel and Distributed Processing Symposium 2001 - Apr 24 - 28, 2001, San Francisco, CA
Ruppa K. Thulasiram, Lubomir Litov, Hassan Nojumi, Christopher T. Downing, Guang R. Gao
Available as gzipped Postscript.

"Speculative Prefetching of Induction Pointers"
International Conference on Compiler Construction 2001, Apr 2 - 6, 2001. Italy.
Artour Stoutchinin, Jose Nelson Amaral, Guang R. Gao, Jim Dehnert, Suneel Jain, Alban Douillet
Available as gzipped Postscript.

"Computer Detection of Single Nulcleotide Polymorphisms (SNPs) in Maize ESTs"
Plant & Animal Genome IX Conference - January 13 - 17, 2001
F. Useche, M. Morgante, M.Hanafey, Scott Tingey, Guang R. Gao and Antoni Rafalski

"A Multithreaded Parallel Implementation of a Dynamic Programming Algorithm for Sequence Comparison"
Pacific Symposium on Biocomputing 2001, Pages 311-322, Big Island of Hawaii, January 3 - 7, 2001
W.S. Martins, J.B. del Cuvillo, F.J. Useche, K.B. Theobald, G.R. Gao

"Landing CG on EARTH: A Case Study of Fine-Grained Multithreading on an Evolutionary Path"
Proceedings of SC2000: High Performance Networking and Computing, Dallas, Texas, November 4--10, 2000
Kevin B. Theobald, Gagan Agrawal, Rishi Kumar, Gerd Heber, Guang R.Gao, Paul Stodghill and Keshav Pingali

"Developing a Communication Intensive Application on the EARTH Multithreaded Architecture"
Proceedings of Euro-Par 2000, unchen, Germany, August 28--September 1, 2000
Kevin B. Theobald, Rishi Kumar, Gagan Agrawal Gerd Heber, Ruppa K. Thulasiram and Guang R. Gao

"Parallel FEM Simulation of Crack Propagation --Challenges, Status, and Perspectives"
International Parallel and Distributed Processing Symposium (IPDPS'00),Pages 443-449 Cancun, Mexico, May 2000
Bruce Carter, Chuin-Shan Chen, L. Paul Chew, Nikos Chrisochoides, Guang R. Gao, Gerd Heber, Antony R. Ingraffea, Roland Krause, Chris Myers, Demian Nave, Keshav Pingali, Paul Stodghill, Stephen Vavasis, Paul A. Wawrzynek

"Caching Single-Assignment Structures to Build a Robust Fine-Grain Multi-Threading System"
International Parallel and Distributed Processing Symposium (IPDPS'00),Pages 589-594, Cancun, Mexico, May 01-05, 2000
Wen-Yen Lin, Jean-Luc Gaudiot, Jos‰ Nelson Amaral and Guang R. Gao

"Performance Analysis of the I-Structure Software Cache on Multi-Threading Systems"
19th IEEE International Performance, Computing and Communication Conference-IPCCC2000, Phoenix, Arizona, USA, Feb. 20-22, 2000
Wen-Yen Lin, Jean-Luc Gaudiot, Jos‰ Nelson Amaral and Guang R. Gao

"A Comparative Performance Study of Fine-Grain Multi-threading on Distributed Memory Machines"
19th IEEE International Performance, Computing and Communication Conference-IPCCC2000, Phoenix, Arizona, USA, Feb. 20-22, 2000
Prasad Kakulavarapu, Christopher J. Morrone, Kevin B. Theobald, Jos‰ Nelson Amaral and Guang R. Gao

"Coping With Very High Latencies in Petaflops Computer Systems"
ISHPC '99, May 25, 1999
Sean Ryan, Jose N. Amaral, Guang Gao, Zachary Ruiz, Andres Marquez, Kevin Theobald.

"A Multithreading Parallel Computational Approach for Valuing Derivatives"
First WAFA Finance Research Conference, Fairfax, Virginia, April 30, 1999
R.K. Thulasiram and Guang R. Gao

"Load Adaptive Algorithms and Implementations for the 2D Discrete Wavelet Transform on Fine-Grain Multithreaded Architectures"
SPDP '99, San Juan, Puerto Rico, April, 1999
Ashfaq A. Khokhar, Gerd Heber, Parimala Thulasiraman, Guang R. Gao
Available as gzipped Postscript.

"A New Approach to Parallel Dynamic Partitioning for Adaptive Unstructured Meshes"
SPDP '99, San Juan, Puerto Rico, April 1999
Gerd Heber, Rupak Biswas, Guang R. Gao.
Available as gzipped Postscript.

"Self-Avoiding Walks over Adaptive Unstructured Grids"
Irregular '99, San Juan, Puerto Rico, April, 1999
Gerd Heber, Rupak Biswas, Guang R. Gao.
Available as gzipped Postscript.

"Efficient State-Diagram Construction Methods for Software Pipelining"
CC '99, Amsterdam, The Netherlands, March 20-28, 1999
Chihong Zhang, R. Govindarajan, Sean Ryan, Guang R. Gao.
Available as gzipped Postscript.

"HTMT-C: Proposing A Programming Language For A Petaflop Machine"
Mid-Atlantic Student Workshop on Programming Languages and Systems, Pages 53-68, Maryland, March 1999
Sean Ryan, Jose Nelson. Amaral, Zachary Ruiz and Guang Gao

"Superconducting Processors for HTMT: Issues and Challenges"
FRONTIERS '99, pp 260-267, Annapolis, Maryland, February 21-25 1999
Kevin B. Theobald, Guang R. Gao.
Available as gzipped Postscript.

"Performance Prediction for the HTMT: A Programming Example"
TFP3 '99, Annapolis, Maryland, February 22, 1999
Jose Nelson Amaral, Guang R. Gao, Phillip Merkey, Thomas Sterling, Zachary Ruiz, Sean Ryan.

"Design and Evaluation of Dynamic Load Balancing Schemes under a Fine-grain Multithreaded Execution Model"
Multithreaded Execution Architecture and Compilation Workshop, Orlando, Florida, Jan 9th, 1999
Haiying Cai, Olivier Maquelin, Prasad Kakulavarapu, and Guang R. Gao.

"An Implementation of a Hopfield Network Kernel on EARTH"
Brazilian Symposium on Computer Architecture and High Performance Processing, Pages 223-232, B”zios, RJ, Brazil, September 1998
Jose N. Amaral, Guang Gao, Xinan Tang
Available as gzipped Postscript.

"Using Multithreading for the Automatic Load Balancing of Adaptive Finite Element Meshes"
Irregular '98, Berkeley, California, August, 1998
Gerd Heber, Rupak Biswas, Parimala Thulasiraman, Guang R. Gao
Available as gzipped Postscript.

"Elastic History Buffer: A Low-Cost Method to Improve Branch Prediction Accuracy"
ICCD '97, pp. 82-87, Austin, Texas, October 12-15, 1997
Guang R. Gao, Maria-Dana Tarlescu, Kevin B. Theobald.
Available as gzipped Postscript.

"A Superstrand Architecture and its Compilation"
Proceedings of the 1997 IEEE International Conference on Author Guidelines (ICAG'97), pp. 10--12, Somewhere, Someplace, April 1, 1997
Andres, Marquez, Kevin B. Theobald, Xinan Tang, and Guang R. Gao

"Thread Partitioning and Scheduling Based on Cost Model"
SPAA '97, pp. 272-281, Newport, Rhode Island, June 22-25, 1997
Guang R. Gao, Xinan Tang, Jian Wang, Kevin B. Theobald.
Available as gzipped Postscript.


 

 

Journal Papers

Single-Dimension Software Pipelining for Multidimensional Loops
ACM Transactions on Architecture and Code Optimization (TACO), Volume 4, Issue 1, March 2007, Article No. 7.
Hongbo Rong, Zhizhong Tang, R. Govindarajan, Alban Douillet, Guang R. Gao

Performance Portability on EARTH: A Case Study across Several Parallel Architectures
Cluster Computing, Volume 10, Number 2, June, 2007, page 115-126.
Weirong Zhu, Yanwei Niu, and Guang R. Gao

Madd Operation Aware Redundancy Elimination
International Journal of Software Engineering and Knowledge Engineering, Vol. 15, No. 2, 2005, pp357-362
Haiping Wu, Ziang Hu, Joseph Manzano and Guang. R. Gao.

Improving Power Efficiency with Compiler-Assisted Cache Replacement
Journal of Embedded Computing accepted, 2005
Hongbo~Yang, R. Govindarajan, Guang R. Gao, Ziang Hu

A Cluster-Based Solution for High Performance Hmmpfam Using EARTH Execution Model
International Journal of High Performance Computing and Networking, Vol 2, Issue 2/3/4, 2004
Weirong Zhu, Yanwei Niu, Jizhu Lu, Chuan Shen and Guang R. Gao,

An Improved Hidden Markov Model for Transmembrane Protein Topology Prediction and Its Applications to Complete Genomes
Bioinformatics, Volume 21, Number 9, pp. 1853-158, 2005
Robel Kahsay, Li Liao , Guang Gao

Quasi-Consensus Based COMParison of Profile Hidden Markov Models for Protein Sequences
Bioinformatics, Volume 21, Number 10, pp. 2287-2293, 2005
Robel Kahsay, Guoli Wang, Guang Gao, Li Liao and Roland Dunbrack.

Efficient Multithreaded Algorithms for the Fast Fourier Transform
Parallel and Distributed Computing Practices, Vol. 5, No. 2, Pages: 177-191, 2004
Parimala Thulasiraman, Kevin B. Theobald, Ashfaq A. Khokhar, and Guang R. Gao

A Fine-Grain Load Adaptive Algorithm of the 2D Discrete Wavelet Transform for Multithreaded Architectures
Journal of Parallel and Distributed Computing (JPDC), Vol.64, No.1, Pages: 68-78, January 2004
Parimala Thulasiraman, Ashfaq A. Khokhar, Gerd Heber, Guang R. Gao

Evaluation and Choice of Various Branch Predictors for Low-Power Embedded Processor
Journal of Computer Science and Technology, Vol. 18, No. 6, Pages: 833-838, November, 2003
Dong Rui Fan, Hongbo Yang, Gaung R. Gao, and Rong Cai Zhao

Minimum Register Instruction Sequencing to Reduce Register Spills in Out-of-Order Issue Superscalar Architectures
IEEE Transactions on Computers, Vol. 52, No. 1, Pages: 4-20, January 2003
Ramaswamy Govindarajan, Hongbo Yang, José N Amaral, Chihong Zhang, and Guang R. Gao

Implementation of the EARTH Programming Model on SMP Clusters: a Multi-Threaded Language and Runtime System
Concurrency and Computation: Practice and Experience, Vol. 15, No. 9, Pages: 821-844, August 2003
Guy Tremblay, Christopher J. Morrone, José N. Amaral, and Guang R.Gao

Minimizing Buffer Requirements in Rate-Optimal Schedules in Regular Dataflow Networks
Journal of VLSI Signal Processing, Vol. 31, No. 3, Pages: 207-229, Jul 2002
Ramaswamy Govindarajan and Guang R. Gao

A Theory for Co-Scheduling Hardware and Software Pipelines in ASIPs and Embedded Processors
Design Automation for Embedded Systems, Vol. 6, No. 3, Pages: 243-275, March 2002
Ramaswamy Govindarajan, Erik R. Altman, and Guang R. Gao

CASA: A Server for The Critical Assessment of Sequence Alignment Accuracy
Bioinformatics, Vol. 18, No. 3, Pages: 496-497, March 2002
Robel Y. Kahsay, Nataraj Dongre, Guang R. Gao, Guoli Wang, and Roland L. Dunbrack Jr.

TROLL--Tandem Repeat Occurrence Locator
Bioinformatics, Vol. 18, No. 4, Pages: 634-636, April 2002
Adalberto T. Castelo, Wellington S. Martins, and Guang R. Gao

Exploiting Locality in single Assignment Data Structures Updated through Split Phase Transactions
Cluster Computing, Special issue on Internet Scalability: Advances in Parallel, Distributed and Mobile Systems, Vol. 4, No. 4, Pages: 281-293, October 2001
José N Amaral, Wen-Yen Lin, Jean-Luc Gaudiot, and Guang R. Gao

Dynamic Load Balancers for a Multithreaded Multiprocessor System
Parallel Processing Letters, Vol. 11, No. 1, Pages: 169-184, March 2001
Prasad Kakulavarapu, Olivier Maquelin, José N Amaral, and Guang R. Gao

A New Memory Model and Cache Consistency Protocol
IEEE Transactions on Computers, Vol. 49, No. 8, Pages: 798-813, August 2000
Guang R. Gao and Vivek Sarkar, Location Consistency

Automatically Partitioning Threads for Multithreaded Architectures
Special Issues on Compilation and Architectural Support for Parallel Applications, Journal of Parallel and Distributed Computing, Vol. 58, No. 2, Pages: 159-189, August 1999
Xinan Tang and Guang R. Gao

Advances in the Dataflow Computational Model
Parallel Computing , Vol. 25, No.13 - 14, Pages: 1907 . 1927, 1999
Walid A. Najjar , Edward A Lee, and Guang R Gao

A New Framework for Elimination Based Data Flow Analysis Using DJ Graphs
ACM Transaction on Programming Languages and Systems, Vol. 20, No. 2, Pages 388-435, March 1998
Vugranam C. Sreedhar, Guang R. Gao, and Yong-Fong Lee

Optimal Modulo Scheduling Through Enumeration
International Journal on Parallel Programming, Vol. 26, No.2, Pages: 313-344, 1998
Erik R. Altman and Guang R. Gao

A Unified Framework for Instruction Scheduling and Mapping for Function Units with Structural Hazards
Journal of Parallel and Distributed Computing, Vol. 49, No. 2, Pages: 259-293, 1998
Erik R. Altman, Ramaswamy Govindarajan, and Guang R. Gao

Incremental Computation of Dominator Trees
ACM Transactions on Programming Languages and Systems, Vol. 19, No. 2, Pages: 239-252, March 1997
Vugranam C. Sreedhar, Guang R. Gao, and Yong-fong Lee

A Quadratic Time Algorithm for Computing Multiple Node Immediate Dominators
Journal of Programming Languages, 1996
Vugranam C. Sreedhar, Guang R. Gao, and Yongfong Lee

A Framework for Resource-constrained Rate-optimal Software Pipelining
IEEE Transactions on Parallel and Distributed Systems, Vol. 7, No. 11, Pages: 1133-1149, November 1996
Ramaswamy Govindarajan, Erik R. Altman, and Guang R. Gao

A Study of the EARTH-MANNA Multithreaded System
International Journal of Parallel Programming, Vol. 24, No. 4, Page 319-347, August 1996
Herbert H. J. Hum, Olivier Maquelin, Kevin B. Theobald, Xinmin Tian, Guang R. Gao, and Laurie J. Hendren

Identifying Loops Using DJ Graphs
ACM Transactions on Programming Languages and Systems (TOPLAS), Vol. 18, No. 6, Pages: 649 . 658, November 1996
Vugranam Sreedhar, Guang R. Gao, and Yongfong Lee

A Linear Time Algorithm for Placing OE-nodes
Journal of Programming Languages, 1995. Accepted
Vugranam C. Sreedhar and Guang R. Gao

Automatic Data and Computation Decomposition for Distributed Memory Machines
Parallel Processing Letters, Vol. 5, No. 4, Pages: 539-550, April 1995
Qi Ning, Vincent V. Dongen, and Guang R. Gao

Computing phi-nodes in Linear Time Using DJ Graphs
Journal of Programming Languages, Vol. 3, Pages: 191-213, April 1995
Vugranam C. Sreedhar and Guang R. Gao

ABC++: Concurrency by Inheritance in C++
IBM Systems Journal, Vol. 34, No. 1, Pages: 120-137, 1995
Eshrat Arjomandi, William O'Farrell, Ivan Kalas,Gita Koblents, Frank Ch. Eigler, and Guang. R. Gao

Rate-optimal Schedule for Multi-rate DSP Computations
Journal of VLSI Signal Processing, Vol. 9, No.3, Pages: 211-232, April 1995
Ramaswamy Govindarajan and Guang R. Gao

An Efficient Hybrid Dataflow Architecture Model
Journal of Parallel and Distributed Computing, Vol. 19, No. 4, Pages: 293-307, December 1993
Guang. R. Gao

A Register Allocation Framework Based on Hierarchical Cyclic Interval Graphs
The Journal of Programming Languages, Vol. 1, No. 3, Pages: 155-185, 1993
Laurie J. Hendren, Guang R. Gao, Erik R. Altman, and Chandrika Mukerji

Optimal Loop Storage Allocation for Argument-fetching Dataflow Machines
International Journal of Parallel Programming, Vol. 21, No. 6, Pages: 421-448, December 1992
Qi Ning and Guang R. Gao
A High-speed Memory Organization for Hybrid Dataflow/von Neumann Computing
Future Generation Computer Systems, Vol. 8, Pages: 287-301, 1992
Herbert H. J. Hum, and Guang. R. Gao

Toward Efficient Fine-grain Software Pipelining and the Limited Balancing Techniques
International Journal of Mini and Microcomputers, Vol. 13, No. 2, Pages: 57-68, 1991
Guang. R. Gao, Herbert H. J. Hum, and Yue-Bong Wong

Exploiting Fine-grain Parallelism on Dataflow Architectures
Parallel Computing, Vol. 13, No. 3, Pages: 309-320, March 1990
Guang R. Gao


 

Technical Memos

CAPSL Technical Memo 81:
Performance Tuning of the Fast Fourier Transform on a Multicore Architecture

Liping Xue, Long Chen, Ziang Hu, and Guang R Gao
Febraury 8th, 2008


CAPSL Technical Memo 80:
Order Free Consistency: Towarding a Fully Asynchronous Memory Model

Chen Chen, Joseph B Manzano, Wenguang Chen, and Guang R Gao
November, 2007
Available upon request

CAPSL Technical Memo 79:
Concurrency Analysis for Shared Memory Programs with Textually Unaligned Barriers

Yuan Zhang, Evelyn Duesterwald, and Guang R Gao
November, 2007


CAPSL Technical Memo 78:
Implementation of the Smith-Waterman Algorithm on A Reconfigurable Supercomputing Platform

Peiheng Zhang, Guangming Tan and Guang R. Gao
April 16th, 2007


CAPSL Technical Memo 77:
A Study of Parallel Betweenness Centrality Algorithm on a Many-core architecture

Guangming Tan and Guang R. Gao
June 27th, 2007
Also available in pdf format

CAPSL Technical Memo 76:
FAME: Financial Application with Many-core-on-a-chip architecturE

Weirong Zhu, Parimala Thulasiraman, Ruppa K. Thulasiram and Guang R. Gao
February 17th, 2006
Also available in pdf format

CAPSL Technical Memo 75:
Optimizing the LU Benchmark for the Cyclops-64 Architecture

Ioannis E. Venetis and Guang R. Gao
February, 2007
Also available in pdf format

CAPSL Technical Memo 74:
Exploring a Multithreaded Methodology to Implement a Network Communication Protocol on the IBM Cyclops-64 Multithreaded Architecture

Ge Gan, Ziang Hu, Juan del Cuvillo, and Guang R. Gao
January, 2007
Also available in pdf format

CAPSL Technical Memo 73:
A Parallel Dynamic Porgramming Algorithm on a Multi-core Architecture

Guangming Tant, and Guang R. Gao
February, 2007
Also available in pdf format

CAPSL Technical Memo 72:
Automatic Program Segment Similarity Detection in Targeted Program Performance Improvement

Haiping Wu, Eunjung Park, Mihailo Kaplarevic, Yingping Zhang, Murat Bolat and Guang R. Gao
December 30, 2006
Also available in pdf format

CAPSL Technical Memo 71:
An Automatic Methodology for Program Segment-based Compiler Optimization Search

Haiping Wu, Eunjung Park, Murat Bolat, Mihailo Kaplarevic, Yingping Zhang, Xiaoming Li and Guang R. Gao
November 14, 2006
Also available in pdf format

CAPSL Technical Memo 70:
Handling Massive Parallelism Efficiently: Introducing Batches of Threads

Ioannis E. Venetis, Theodore S. Papatheodorou, and Guang R. Gao
October 18, 2006
Also available in pdf format

CAPSL Technical Memo 69:
Software Pipelining On Multi-core Chip Architectures: A case study on IBM Cyclops-64 Chip Architure

Alban Douillet, Junmin Lin, and Guang R. Gao
February 14, 2006
Available upon request

CAPSL Technical Memo 68:
Server I/O Acceleration Using an Embedded Multi-core Architecture

Lurng-Kuo Liu, Fei Chen, Christos J. Georgiou and Guang R. Gao
May 12, 2006
Available upon request

CAPSL Technical Memo 67 Revised:
Synchronization State Buffer: Supporting Efficient Fine-Grain Synchronization on Many-Core Architectures

Weirong Zhu, Vugranam C. Sreedhar, Ziang Hu, and Guang R. Gao
Novemeber 20, 2006
Available upon request

CAPSL Technical Memo 67:
Efficient Fine-Grain Synchronization on a Multi-Core Chip Architecture: A Fresh Look

Weirong Zhu, Ziang Hu, and Guang R. Gao
July 17, 2006

CAPSL Technical Memo 66:
An Efficient Communication Infrastructure for the IBM Cyclops-64 Computer System

Ge Gan, Ziang Hu, Juan del Cuvillo, Guang R. Gao
June 12, 2006
Also available in pdf format

CAPSL Technical Memo 65:
Optimized Lock Assignment and Allocation for Productivity: A Method for Exploiting Concurrency among Critical Sections

Yuan Zhang, Vugranam C. Sreedhar, Weirong Zhu, Vivek Sarkar, Guang R. Gao
May 10th, 2006
Also available in pdf format

CAPSL Technical Memo 64:
Multidimensional Kernel Generation for Loop Nest Software Pipelining

Alban Douillet, Hongbo Rong, and Guang R. Gao
Febraury 13th, 2006
Also available in pdf format

CAPSL Technical Memo 63:
A New Framework for Analysis and Optimization of Shared Memory Parallel Programs"

Vugranam C. Sreedhar, Yuan Zhang, and Guang R. Gao
July 18th, 2005

CAPSL Technical Memo 62:
" FAST: A Functionally Accurate Simulation Toolset for the Cyclops-64 Cellular Architecture"
Juan del Cuvillo, Weirong Zhu, Ziang Hu, and Guang Gao
June 17th, 2005
Also available in pdf format

CAPSL Technical Memo 61:
"P3I: Delaware's Programmability, Productivity and Proficiency Inquiry"
Joseph B. Manzano, Yuan Zhang, and Guang R. Gao
June 10th, 2005

CAPSL Technical Memo 60:
"Performance Analysis of Interconnection Network of Cyclops-64 Chip Architecture"
Yingping Zhang, Taikyeong Jeong, Fei Chen, Ronny Nitzsche, Guang R. Gao
June 1st, 2005

CAPSL Technical Memo 59:
"Concurrency Analysis and Its Applications"
Yuan Zhang, and Guang Gao
May 28th, 2005

CAPSL Technical Memo 58:
"Register Pressure in Software Pipelined Loop Nests: Fast Computation and Impact on Architecture Design"
Alban Douillet, Hongbo Rong, and Guang Gao
May 3rd, 2005

CAPSL Technical Memo 57:
"Parallel Reconstruction for Parallel Imaging SPACERIP on Cellular Architecture"
Yuanwei Niu, Ziang Hu, and Guang Gao
June 15, 2004

CAPSL Technical Memo 56:
"Quasi consensus based comparison of profile hidden Markov models for protein sequences"
Robel Y. Kahsay, Guoli Wang, Li Liao, Roland Dunbrack and Guang Gao
May 28, 2004

CAPSL Technical Memo 55:
"Toward a Software Infrastructure for the Cyclops-64 Cellular Architecture"
Juan B. del Cuvillo, Ziang Hu, Weirong Zhu, Fei Chen and Guang R. Gao
April 26, 2004
Also available in pdf format

CAPSL Technical Memo 54:
"Speeding up CG on Cluster with Two Dimensional Blocking Method and EARTH Runtime Support"
Fei Chen, Kevin B. Theobald and Guang R. Gao
April 23, 2004

CAPSL Technical Memo 53:
"Lamport Order Revisit: A Study on How to Efficiently Achieve Sequential Consistency on a Modern Multiprocessor-on-a-Chip Architecture"
Yuan Zhang, Weirong Zhu, Fei Chen, Ziang Hu and Guang R. Gao
March 01, 2004
Also available in pdf format

CAPSL Technical Memo 52:
"Analyzable Atomic Sections: Integrating Fine-Grained Synchronization and Weak Consistency Models for Scalable Parallelism"
Vivek Sarkar and Guang R. Gao
February 09, 2004
Also available in pdf format

CAPSL Technical Memo 51:
"Code Generation for Single-Dimension Software Pipelining of Multi-Dimensional Loops"
Hongbo Rong, Alban Douillet, R.Govindarajan and Guang R. Gao
September 26, 2003
Also available in pdf format

CAPSL Technical Memo 49:
"Single-Dimension Software Pipelining for Multi-Dimensional Loops"
Hongbo Rong, Zhizhong Tang, R.Govindarajan, Alban Douillet and Guang R. Gao
September 26, 2003
Also available in pdf format

CAPSL Technical Memo 48:
"Programming Method and software Infrastructure for Cellular Architecture"
Guang R. Gao, Juan del Cuvillo, Ziang Hu, Robert Klosiwicz, Clement Leung, Jason McGuiness, Hirofumi Sakane, Yingping Zhang
September 16, 2003

CAPSL Technical Memo 47:
"Compiler-Assisted Cache Replacement: Problem Formulation and Performance Evaluation"
Hongbo Yang, R. Govidarajan, Guang R. Gao, Ziang Hu
September 9, 2003

CAPSL Technical Memo 44:
"Algorithms, Applications, and Environments for Emerging Petascale Architectures"
R. Govindarajan, and H. Tufo and S. Thomas and R. Loft and Guang R. Gao and J. Moreira and J.Castanos
March 6, 2003
Also available in pdf format

CAPSL Technical Memo 43:
"Executable Performance Model and Evaluation of High Performance Architectures with Percolation"
Adeline Jacquet, Vincent Janot, R. Govindarajan, Clement Leung, Guang Gao and Thomas Sterling
November 21, 2002
Also available in pdf format

CAPSL Technical Memo 42:
"A Quantitative Study on Performance-Power Impact of Dual-Speed Pipeline Architectures"
Hongbo Yang, R.Govindarajan, Guang R. Gao, Kevin B. Theobald
June 13, 2002
Also available in pdf format

CAPSL Technical Memo 41:
"Maximizing Pipelined Functional Units Usage for Minimum Power Software Pipelining"
Hongbo Yang, R.Govindarajan, Guang R. Gao, George Cai
September 27, 2001
Also available in pdf format

CAPSL Technical Memo 40:
"New Normalization Method and Error Analysis for Gene Expression Microarray Data"
Stanley D. Luck, Francisco Jose Useche G., Wellington S. Martins, Guang R. Gao
December 11, 2000
Also available in pdf format

CAPSL Technical Memo 39:
"Threaded-C Language Reference Manual (Release 2.0)"
Guy Tremblay, Kevin B.Theobald, Christopher J.Morrone, Mark D.Butala, Jose Nelson Amaral and Guang R. Gao
September 23, 2000

CAPSL Technical Memo 38:
"Automatic Prefetching of Induction Pointers"
Artour Stouctchinin, Jose Nelson Amaral, Guang R. Gao, Jim Dehnert, Suneel Jain, Alban Douillet
April 18, 2000

CAPSL Technical Memo 37:
"Automatic Prefetching of Induction Pointers for Software Pipelining"
Artour Stoutchinin, Jose Nelson Amaral, Guang R. Gao, Jim Dehnert, Suneel Jain
November 12, 1999

CAPSL Technical Memo 36:
"Minimum Register Instruction Sequence Problem: Revisiting Large Optimal"
R. Govindarajan, Hongbo Yang, Chihong Zhang, Jose Nelson Amaral, Guang R. Gao
November 12, 1999

CAPSL Technical Memo 35:
"A Comparative Performance Study of Fine-Grain Multi-Threading on Distributed Memory Machines"
Prasad Kakulavarapu, Christopher J. Morrone, Kevin B. Theobald, Jose Nelson Amaral, Guang R. Gao
November 11, 1999

CAPSL Technical Memo 34:
"Caching Single-Assignment Structures to Build a Robust Fine-Grain Multi-Threading System"
Wen-Yen Lin, Jose Nelson Amaral, Jean-Luc Gaudiot, Guang Gao
October 13, 1999

CAPSL Technical Memo 33:
"Definition of the EARTH Model"
Kevin B. Theobald
October 6, 1999

CAPSL Technical Memo 32:
"The Benefits of Hardware-Assisted Fine-Grain Multithreading"
Kevin B. Theobald, Guang R. Gao
July 20, 1999

CAPSL Technical Memo 31:
"HTMT Phase 2 Report"
Guang R Gao, Jose Nelson Amaral, Andres Marquez, Kevin B. Theobald, Sean Ryan, Zachary Ruiz, Thomas Geiger, Christopher J. Morrone
July 19, 1999

CAPSL Technical Memo 30:
"Design and Implementation of an Eefficient Thread Partitioning Algorithm"
Jose Nelson Amaral, Guang R. Gao, Erturk Dogan Kocalar, Patrick O'Neil, Xiang Tang
July 1, 1999

CAPSL Technical Memo 29:
"Advances in Dataflow Computational Model"
Walid A Najjar, Edward A. Lee and Guang R. Gao
April 1, 1999

CAPSL Technical Memo 28:
"Efficient State-Diagram Construction Methods for Software Pipelining"
Chihong Zhang, R. Govindarajan, Sean Ryan, Guang R. Gao
March 5, 1999

CAPSL Technical Memo 27:
"SEMi: A Simulator for EARTH, MANNA, and i860"
Kevin Theobald
March 1, 1999

CAPSL Technical Memo 26:
"An HTMT Performance Prediction Case Study: Implementing Cannon's Dense Matrix Multiply Algorithm"
Jose Nelson Amaral, Guang R. Gao, Phillip Merkey, Thomas Sterling, Zachary Ruiz, Sean Ryan
February 17, 1999

CAPSL Technical Memo 25:
"Option Pricing Problem on a Multithreaded Parallel Architecture"
Ruppa K. Thulasiram and Guang R.Gao
November 11, 1998

CAPSL Technical Memo 24:
"Design of the Runtime System for the Portable Threaded-C Language"
Prasad Kakulavarapu, Olivier Maquelin and Guang R. Gao
July 21, 1998

CAPSL Technical Memo 23:
"Automatically Partitioning Threads Based on Remote Paths"
Xinan Tang, Guang R. Gao
July 20, 1998

CAPSL Technical Memo 22:
"A Refinement of the HTMT Program Execution Model"
Guang Gao, Jose Nelson Amaral, Andres Marquez and Kevin Theobald"
July 13, 1998

CAPSL Technical Memo 21:
"Self-Avoiding Walks Over Two-Dimensional Adaptive Unstructured Grids"
Gerd Heber, Rupak Biswas and Guang R. Gao
April 20, 1998

CAPSL Technical Memo 20:
"Using Multithreading for the Automatic Load Balancing of 2-D Adaptive Finite Element Meshes"
Gerd Heber, Rupak Biswas,Parimala Thulasiraman, and Guang R. Gao
March 16, 1998

CAPSL Technical Memo 19:
"Overview of the Threaded-C Language"
Kevin B. Theobald, Jose Nelson Amaral, Gerd Herber, Oliver Maquelin, Xinan Tang, Guang R. Gao
March 16, 1998

CAPSL Technical Memo 18:
"A Superstrand Architecture"
Andres Marquez, Kevin B. Theobald, Xinan Tang, Thomas L. Sterling and Guang R. Gao
March 14, 1998

CAPSL Technical Memo 17:
"An Enhanced Co-Scheduling Method Using Reduced MS-State Diagrams"
R. Govindarajan, N.S.S. Narasimha Rao, Erik R. Altman and Guang R. Gao
February 18, 1998

CAPSL Technical Memo 16:
"Location Consistency -- A New Memory Model and Cache Consistency Protocol"
Guang R. Gao, Vivek Sarkar
February 16, 1998

CAPSL Technical Memo 15:
"Superconducting Processors for HTMT: Issues and Challenges"
Kevin B. Theobald, Guang R. Gao and Thomas L. Sterling
December 15, 1997

CAPSL Technical Memo 14:
"A Superstrand Architecture"
Andres Marquez, Kevin B. Theobald, Xinan Tang, Guang R. Gao
December 1, 1997

CAPSL Technical Memo 13:
"Partial Sampling with Reverse State Reconstruction: A New Technique for Branch Predictor Performance Estimation"
Darren E. Vengroff and Guang R. Gao

CAPSL Technical Memo 12:
V. Sreedhar's paper (no information on this)

CAPSL Technical Memo 11:
"Heap Analysis and Optimizations for Threaded Programs"
Xinan Tang, Rakesh Ghiya, Laurie J. Hendren and Guang R. Gao
November 7, 1997

CAPSL Technical Memo 10:
"A Register Pressure Sensitive Instruction Scheduler for Dynamic Issue Processors"
Raul Silvera, Jian Wang and Guang R. Gao

CAPSL Technical Memo 09:
"The HTMT Program Execution Model"
Guang R. Gao, Kevin B. Theobald, Andres Marquez, Thomas Sterling
July 18, 1997

CAPSL Technical Memo 08:
"Benefits of Efficient Multithreading on Distributed Memory for the Parallelization of Communication-Intensive Applications"
Angela C. Sodan and Guang R. Gao

CAPSL Technical Memo 07:
"An Interger Linear Programming Model of Software Pipelining for the MIPS R8000 Processor"
Artour Stoutchinin

CAPSL Technical Memo 06:
"A New Fast Algorithm for Optimal Register Allocation in Modulo Scheduled Loops"
Sylvain Lelait, Guang R. Gao and Christine Eisenbeis

CAPSL Technical Memo 05:
"Design and Evaluation of Dynamic Load Balancing Schemes under A Multithreaded Execution Model"
Haiying Cai, Olivier Maquelin and Guang R. Gao

CAPSL Technical Memo 04:
"Non-Clustered Statistical Trace Sampling for Large Cache Design Space Exploration"
Darren E. Vengroff, Kenneth Simpson and Guang R. Gao

CAPSL Technical Memo 03:
"Thread Partitioning and Scheduling Based on Cost Model"
Xinan Tang, Jian Wang, Kevin B. Theobald and Guang R. Gao
April 15, 1997

CAPSL Technical Memo 02:
"Elastic History Buffer: A Low-Cost Method to Improve Branch Prediction Accuracy"
Maria-Dana Tarlescu, Kevin B. Theobald, Guang R. Gao
November 14, 1996

CAPSL Technical Memo 01:
"Hybrid Technology Multithreaded Architecture"
Guang R. Gao, Konstantin K. Likharev, Paul C. Messina and Thomas L. Sterling


 

 

Technical Notes

CAPSL Technical Note 19:
"Mrs. Clops Tool Chain Manual"
Matthew Wells
March, 2006

CAPSL Technical Note 18:
"ASAP Low-Level Connection Library"
Inanc Dogru
March, 2006

CAPSL Technical Note 17:
"C64 DDR Verification and Critical Path Reduction"
Michael Bodnar
September, 2005

CAPSL Technical Note 16:
"The Cyclops-E Emulation Environment"
Juan del Cuvillo and Nathaniel Merritt.
August, 2005

CAPSL Technical Note 15:
"SLICED: a Source Level Interacting Cyclops-64 Effective Debugger"
Geoff Gerfin and Ziang Hu, Ph. D.
August 26, 2004

CAPSL Technical Note 14:
"DISC64: A Disassembler for the Instruction Set of Cyclops-64"
John Tully
August 5, 2004

CAPSL Technical Note 13:
"Generate the Multiple and Add Operation during the WHIRL Lowering Phase
Joseph Bryant Manzano Franco and Haiping Wu
May 31, 2004

CAPSL Technical Note 12:
"Integrate EBO with Pattern Matching"
Divya Parthasarathi
May 28, 2004

CAPSL Technical Note 11:
"A DIMES Demonstration Application: Mandelbot-Set Generation Using a Work-Stealing Algorithm"
Jason M. McGuiness
June 15, 2002

CAPSL Technical Note 10 Revised:
"A Software Development Kit for CeDIMES"
Juan del Cuvillo, Robert Klosiewicz and Yingping Zhang
March 15, 2005

CAPSL Technical Note 10:
"A Software Development Kit for CeDIMES"
Juan del Cuvillo, Robert Klosiewicz and Yingping Zhang
September 30, 2002

CAPSL Technical Note 09:
"Threaded-C Release 2.0: Motivation, Description, and Rationale"
Guy Tremblay
June 15, 2000

CAPSL Technical Note 08:
"Runtime Locality Transformations for NAS Conjugate Gradient (Sparse Matrix Computation)"
Rishi Kumar, Nathaniel Johnson, Ruppa K. Thulasiram, Gagan Agrawal, Guang R. Gao
December 17, 1999

CAPSL Technical Note 07:
"Computational Financial Derivatives ---A Primer"
Ruppa K. Thulasiram, Guang R. Gao
October 9, 1998

CAPSL Technical Note 06:
"Debugging: The `Feedback' Way"
James P. Durbano
October 9, 1998

CAPSL Technical Note 05:
"Portable Threaded-C Release 1.1"
Jos'e Nelson Amaral, Zachary Ruiz, Sean Ryan, Andres Marquez, Christopher Morrone, Prasad Kakulavarapu, Guang R. Gao
October 8, 1998

CAPSL Technical Note 04:
"Implementation of I-Structures as a Library of Functions in Portable Threaded-C"
Jos'e Nelson Amaral, Guang R. Gao
June 15, 1998

CAPSL Technical Note 03:
"Proposed Changes to Threaded-C"
Kevin B. Theobald
January 20, 1998

CAPSL Technical Note 02:
"A Portable Threaded-C Language for EARTH Multiprocessors"
Xinan Tang, Olivier Maquelin, Kevin B. Theobald, Guang R. Gao, Prasad Kakulavarapu
January 6, 1998

CAPSL Technical Note 01:
"An Overview of the Threaded-C Language"
Guang R. Gao, Xinan Tang, Parimala Thulasiraman, Kevin B. Theobald
July 25, 1997


 

 

CAPSL Theses

  Ph.D. Theses

  Masters Theses

 

Ph.D. Theses:

"Advanced Protein Sequence Analysis Methods for Structure and Function Prediction"
Robel Y. Kahsay
Spring 2005

"The CARE Architecture"
Andrés Marquez
Winter 2004

"Power-Aware Compilation Techniques for High Performance Processors"
Hongbo Yang
Fall 2003

"Irregular Computations on Fine-Grain Multithreaded Architecture"
Parimala Thulasiraman
Fall 2000

"Compiling for Multithreaded Architectures"
Xinan Tang
Fall 1999

"EARTH: An Efficient Architecture for Running Threads"
Kevin Bryan Theobald
Spring 1999

 

Masters Theses:

"Optimizing the Fast Fourier Transform on a Many core Architecture"
Long Chen
Winter 2008

"Design and Implementation of Tool-chain framework to support OpenMP Single Source Compilation on CELL platform"
Yi Jiang
Winter 2007

"Methodology of Dynamic Compiler Option Selection Based on Static Program Analysis - Implementation and Evaluation"
Eun Jung Park
Summer 2007

"Efficient Mapping of Fast Fourier Transform on the Cyclops-64 Multithreaded Architecture"
Liping Xue
Summer 2007

"Tower Methodology for Verification of Multi-Core Architecture - A Case Study"
Divya Parthasarathi
Summer 2005

"A Study of Architecture and Performance of IBM Cyclops-64 Interconnection Network"
Yingping Zhang
Summer 2005

"Quantitive Study of Human-Computer interaction in adaptive search on Mobile Handsets and its Localization for Mandarin Chinesse"
Xing Wang
Fall 2004

"A Parallel Debugger for the Cyclops Architecture"
Robert S. Klosiewic Jr.
Summer 2004

"Multithreaded Parallel Implementation of HPMMPFAM on EARTH"
Weirong Zhu
Spring 2004

"Implementing Parallel CG Algorithm on the EARTH Multithreaded Architecture"
Fei Chen
Spring 2004

"Code Size Oriented Memory Allocation for Temporary Variables"
Yan Xie
Winter 2004

"Binary Diffing"
Kapil Khosla
Fall 2003

"A Portable Runtime System and its Derivation for the Hardware SU Implementation"
Chuan Shen
Fall 2003

"A Interconnect Architecture for Commodity Off-the-thelf Multiprocessor Emulation Testbed"
Mark Lawrence Legutko
Spring 2002

"A Visual Perspective to Motif/Pattern Analysis"
Praveen R Thiagarajan
Summer 2001

"Automated Single Nucleotide Ploymorphism Discovery Pipeline"
Francisco Jose Useche Gomez
Summer 2001

"Efficient Parallelization of Reductions and Loop Based Programs on EARTH"
Rishi Kumar
Summer 2001

"Whole Genome Comparison Using A Multithreaded Parallel Implementation"
Juan Del Cuvillo
Summer 2001

"A EARTH Runtime System For Multi-Processor/Multi-Node Beowulf Cluster"
Christopher Jason Morrone
Spring 2001

"Implementation Issues of a Hardware-Based EARTH Synchronization Unit"
Thomas Geiger
Spring 2001

"Register Stack and Optimal Allocation Instruction Placement"
Alban Douillet
Spring 2001

"Advanced Compilers, Architectures and Parallel Systems"
ShaoHua Han
Spring 2001

"Dynamic Load Balancing Issues in the EARTH Runtime System"
Kamala Prasade Kakulavarapu
Fall 1999

"Towards a Custom EARTH Synchronization Unit"
Ian Stuart MacKenzie Walker
Summer 1999

"Static Instruction Schedule For Dynamic Issue Processor"
Raul E. Silvera Muñoz
Spring 1997

© CAPSL 1996-2007. All Rights Reserved.