Publications

Refereed Journal/Conference papers

(*=students or former students)

  • Shuaiwen Song, Chun-yi Su, Barry Rountree, Kirk W. Cameron, “A Simplified and Accurate Model of Power-Performance Efficiency on Emergent GPU Architectures”, full paper accepted by 27th IEEE International Parallel & Distributed Processing Symposium (IPDPS), Boston, 2013.(link)
  • Hung-Ching Chang, Abhishek R Agrawal, and Kirk W. Cameron, “Energy-Aware Computing for Android Platforms”, Intel Technology Journal on Energy and Sustainability, 2012. (to appear). (link)
  • Hung-Ching Chang, Erik Kruus, Tomas J Barnes, Abhishek R Agrawal, and Kirk W. Cameron, “Storage Power Optimizations for Client Devices and Data Centers”, Intel Technology Journal on Energy and Sustainability, 2012. (to appear) (link)
  • C. Su, D. Li, D. Nikolopoulos, M. Grove, K. Cameron, B. de Supinski, Critical Path-Based Thread Placement for NUMA Systems, ACM SIGMETRICS Performance Evaluation Review, 40(2) 2012. (link)
  • C. Su, D. Li, D. Nikolopoulos, K. Cameron, B. de Supinski, E.A. Leon. ” Model-Based, Memory-Centric Performance and Power Optimization on NUMA Multiprocessors”. IISWC 2012. (link)
  • *S.Song, Kirk W. Cameron, “System-Level Power-Performance Efficiency Modeling for Emergent GPU Architectures”, accepted abstract in proceedings of THE 21ST INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT’12), Minneapolis, MN. (link)
  • *Bo Li, S. Song, Ivona Bezakova, Kirk W. Cameron, “Energy-Aware Replica Selection for Data-Intensive Services in Cloud”, short paper, to appear in IEEE International Symposium on Modeling , Analysis, and Simulation of Computer and Telecommunication System (MASCOTS’12), Washington DC. (link)
  • Dong Li, Bronis de Supinski, Martin Schulz, Dimitrios Nikolopoulos, and Kirk Cameron. “Strategies for Energy Efficient Resource Management of Hybrid Programming Models”. IEEE Transaction on Parallel and Distributed Systems. To appear, 2012. (link)
  • *S. Song, M. Grove and K.W. Cameron, An iso-energy-efficient approach to scalable system power-performance optimization, in proceedings of the IEEE International Conference on Cluster Computing (Cluster 2011), Austin, Texas, September 2011. (link)
  • Charles W. Lively, Xingfu Wu, Valerie E. Taylor, Shirley Moore, Hung-Ching Chang, Kirk W. Cameron: Energy and performance characteristics of different parallel implementations of scientific applications on multicore systems. IJHPCA 25(3): 342-350 (2011) (link)
  • C. Lively, X. Wu, V. Taylor, S. Moore, *H. Chang, *C. Su and K.W. Cameron. Power-Aware Predictive Models of Hybrid (MPI/OpenMP) Scientific Applications, International Conference on Energy-Aware High Performance Computing (EnA-HPC 2011), September 07–09, 2011. (link)
  • D. Li, D. Nikolopoulos, K.W. Cameron, B. R. de Supinski and M. Schulz. Scalable Memory Registration for High-Performance Networks Using Helper Threads. In Proceedings of ACM International Conference on Computer Frontier (CF), 2011. (link)
  • Vishnu A, S. Song, A Marquez, K.J. Barker, D.J. Kerbyson, Kirk. W. Cameron and P. Balaji, “Designing Energy Efficient Communication Runtime System: A View from PGAS Models”, Journal of Supercomputing (JOS’11), Springer, 2011. (link)
  • *S. Song, *C.-Y. Su, *R. Ge, A. Vishnu, and K.W. Cameron, Iso-energy-efficiency: An approach to power-constrained parallel computation, Proceedings of 25th IEEE International Parallel and Distributed Processing Symposium (IPDPS 11), 12 pages, May 2011. (link)
  • A. Vishnu, *S. Song, A. Marquez, K. Barker, D. Kerbyson, K.W. Cameron, P. Balaji, Designing Energy Efficient Communication Runtime Systems for Data Centric Programming Models, IEEE/ACM International Conference on Green Computing and Communications (GreenCom 2010), Hangzhou, China, 12 pages, December 2010.(link)
  • *D. Li, *R. Ge, and K.W. Cameron, System-level, Unified In-band and Out-of-band Dynamic Thermal Control, proceedings of 2010 International Conference on Parallel Processing (ICPP 2010), 10 pages, September 2010. (link)
  • Vishnu A, H van Dam, WA De Jong, P. Balaji and S. Song, “Fault Tolerant Communication Runtime Support for Data Centric Programming Models”, in proceedings of International Conference on High Performance Computing (HiPC 2010), India. (link)
  • *Z. Cao, *D. R. Easterling, L. T. Watson, *D. Li, K. W. Cameron, and W.-C. Feng, Power saving experiments for large scale global optimization”, International Journal of Parallel Emergent Distributed Systems, 25(4): pp. 381-400, 2010.
  • *D. Li, B. R. de Supinski, M. Schulz, K. W. Cameron, D. S. Nikolopoulos, Hybrid MPI/OpenMP Power-Aware Computing. Proceedings of 24th IEEE International Parallel and Distributed Processing Symposium (IPDPS 10), 12 pages, April 2010. (link)
  • *D. Li, D. Nikolopoulos, K. W. Cameron, B. R. de Supinski, M. Schulz, Power-aware MPI Task Aggregation Prediction for High-End Computing Systems. Proceedings of 24th IEEE International Parallel and Distributed Processing Symposium (IPDPS 10), April 2010. (link)
  • *Ge, R., *Feng, X., *Song, S., *Chang, H-C., *Li, D., Cameron, K.W., PowerPack: Energy Profiling and Analysis of High-Performance Systems and Applications. IEEE Transactions on Parallel and Distributed Systems, IEEE Computer Society, 21(5): 658-671, 2010. (link)
  • *Z. Cao, L. T. Watson, K. W. Cameron, *R. Ge: A power aware study for VTDIRECT95 using DVFS. Proceedings of SpringSim 2009.
  • *Song, S., *Ge, R., *Feng, X., Cameron, K.W., Energy profiling and analysis of the HPC Challenge Benchmarks, International Journal of High Performance Computing Applications (IJHPCA’09), Sage Publications, New York, 2009, 23(3): 265-276, 2009. (link)
  • *M. Tolentino, *J. Turner, and K.W. Cameron, Memory-MISER: Improving Main Memory Energy Efficiency in Servers. IEEE Transactions on Computers, IEEE Computer Society Press, vol. 58, no. 3, pp. 336-350, 2008.
  • *D. Li, *S. Huang, and K. W. Cameron, CG-Cell: An NPB Benchmark Implementation on Cell Broadband Engine. Theoretical Computer Science, M. Conti, P. Das, N. Santoro (Eds.), Elsevier, Amsterdam, 2008.
  • *D. Li, *S. Huang, and K. W. Cameron, CG-Cell: An NPB Benchmark Implementation on Cell Broadband Engine, proceedings of International Conference on Distributed Computing and Networking (ICDCN 2008), Kolkata, India, January 2008.
  • *Filip Blagojevic, *Xizhou Feng, Kirk W. Cameron, and Dimitris Nikolopoulos, “Modeling Mulitigrain Parallelism on Heterogeneous Multicore Processors: A Case Study of the Cell BE,” proceedings of International Conference on High Performance Embedded Architectures & Compilers (HiPEAC 2008), Goteberg, Sweden, January 2008.
  • W. Feng and K.W. Cameron, “The Green500 List: Encouraging Sustainable Supercomputing”, IEEE Computer, Volume 40, Number 12, 2007.
  • K.W. Cameron, *R. Ge, and X.-H. Sun, “lognP and log3P: Accurate analytical models of point-to-point communication in distributed systems”, IEEE Transactions on Computers, Volume 56, Number 3, 2007.
  • K. W. Cameron; *H. K. Pyla; and S. Varadarajan, “Tempest: A portable tool to identify hot spots in parallel code,” proceedings of 2007 International Conference on Parallel Processing (ICPP 07), Xi An, China, September 2007.
  • *R. Ge; *X. Feng; W. Feng; and K. W. Cameron , “CPU Miser: A performance-Directed, Run-Time System for Power-aware Clusters,” proceedings of 2007 International Conference on Parallel Processing (ICPP 07), Xi An, China, September 2007.
  • *M. Tolentino, *J. Turner, and K.W. Cameron, “Memory-MISER: A performance-constrained runtime system for power-scalable clusters”, Proceedings of ACM International Conference on Computing Frontiers, May 2007.
  • *R. Ge, and K. W. Cameron, “Power-Aware Speedup”, Proceedings of the 21st IEEE International Parallel and Distributed Processing Symposium (IPDPS 07), March 2007.
  • *X. Feng, K. W. Cameron, B. Smith, and C. Sosa, “Building the Tree of Life on Tera-scale Systems,” Proceedings of the 21st IEEE International Parallel and Distributed Processing Symposium (IPDPS 07), March 2007.
  • K. W. Cameron, *X. Feng and *R. Ge, “The Argus Prototype: Aggregate Use of Load Modules as a High density Supercomputer,” Concurrency and Computation: Practice and Experience, Volume 18, Issue 1, 2006.
  • *X. Feng, K.W. Cameron and D.A. Buell, “High Performance, Bayesian-based Phylogenetic Inference Framework”, Proceedings of the 18th IEEE/ACM High Performance Computing, Networking and Storage Conference (SC), 2006.
  • K. W. Cameron, *R. Ge, *X. Feng, “High-Performance, Power-Aware Distributed Computing for Scientific Applications,” IEEE Computer, Volume 38, Issue 11, November 2005.
  • S. Byna, K. W. Cameron and X.-H. Sun, “Memory-Aware Communication -An Experimental Study with MPI,” Parallel Processing Letters, Volume 15, Issue 4, pp 357-365, December 2005.
  • K. W. Cameron, *X. Feng and *R. Ge, “Performance-constrained Distributed DVS Scheduling for Scientific Applications on Power-aware Clusters,” Proceedings of the 17th IEEE/ACM High Performance Computing, Networking and Storage Conference (SC 2005), 15 pgs, November 2005.
  • *X. Feng, *R. Ge, and K. W. Cameron, “Power and Energy Profiling of Scientific Applications on Distributed Systems,” Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS 05), 10 pgs, April 2005.
  • *X. Feng, *R. Ge, and K. W. Cameron, “ARGUS: Supercomputing in 1/10 Cubic Meter,” Proceedings of the International Conference on Parallel and Distributed Computing and Networks (PDCN 2005), February 2005.
  • K. W. Cameron, and *R. Ge, “Predicting and Evaluating Distributed Communication Performance,” Proceedings of the 16th IEEE/ACM International Conference on High Performance Computing and Communications (SC 2004), Nov 2004.
  • K. W. Cameron and X.-H. Sun, “Quantifying Locality Effect in Data Access Delay: Memory logP,” Proceedings of the 17th IEEE International Parallel and Distributed Processing Symposium (IPDPS 03), 8 pgs, April 2003.
  • *M.T. Maxwell and K.W. Cameron, “Optimizing Application Performance: A Case Study Using LMBench,” ACM Crossroads, 8(5), September 2002.
  • Y. Solihin, K. W. Cameron, Y. Luo, D. Lavenier, and M. Gokhale, “Mutable Functional Units and Their Applications on Microprocessors,” Proceedings of the International Conference on Computer Design 2001 (ICCD 2001), pp 234-239, September 2001.
  • X.-H. Sun, K. W. Cameron, D. He, and Y. Luo, “Adaptive Multivariate Regression for Advanced Memory System Evaluation,” Journal of Performance Evaluation, Volume 45, Issue 1, Pages 1-18, May 2001.
  • D. Lavenier, K. W. Cameron, Y. Solihin, “Integer/Floating Point Reconfigurable ALU,” Proceedings of the 6th Symposium on New Machine Architectures (SympA’6), 12 pgs, June 2000.
  • X.-H. Sun, and K. W. Cameron, “A Statistical-Empirical Hybrid Approach to Hierarchical Memory Analysis,” Proceedings of Euro-Par 2000, pp 141-148, August 2000.
  • K. W. Cameron, and Y. Luo, “Instruction-level Microprocessor Modeling of Scientific Applications,” Proceedings of the Second International Symposium on High Performance Computing (ISHPC 99), pp. 29-41, May 1999.
  • X.-H. Sun, D. He, K. W. Cameron, and Y. Luo, “A Factorial Performance Evaluation for Hierarchical Memory Systems,” Proceedings of the 13th International Parallel Processing Symposium (IPPS/SPDP 99), pp. 70-74, April 1999.
  • X.-H. Sun, K. W. Cameron, D. He, and Y. Luo, “A Memory-Centric Characterization of ASCI Applications Via A Combined Approach of Statistical and Empirical Analysis,” Proceedings of the Ninth SIAM Conference on Parallel Processing for Scientific Computing (PPSC 1999), March 1999.

Refereed Workshops

  • S. Shrestha, C. Su, J. B. Manzano, A. White, A. Marquez, K. W. Cameron, G. R. Gao, “MODA: A Framework for Memory Centric Performance Characterization,” WHIST 2012. (link)
  • C. Su, D. Li, D. Nikolopoulos, M. Grove, K. Cameron, B. de Supinski, Critical Path-Based Thread Placement for NUMA Systems, in proceedings of the 2nd International Workshop on Modeling, Benchmarking and Simulation of High Performance Computing Systems (PMBS11) at Supercomputing 2011, Seattle, Washington, November 2011. (link)
  • R. Ge, X. Feng, and K. W. Cameron, “ Modeling and Evaluating Energy-Performance Efficiency of Parallel Processing on Multicore based Power Aware Systems,” Proceedings of the 5th Workshop on High-performance, Power-aware Computing (HPPAC), Rome, Italy, 8 pages, 2009.
  • D. Li, H. Pyla, and K. W. Cameron, “System-level, Thermal-aware, Fully-loaded Processor Scheduling,” Proceedings of the 4th Workshop on High-performance, Power-aware Computing (HPPAC), Miami, FL, 8 pages, 2008.
  • D. Nikolopoulos and K. W. Cameron, “Synthesizing Parallel Programming Models for Asymmetric Multi-Core Systems,” Proceedings of the 11th Workshop on High Performance Embedded Computing, MIT Lincoln Lab, 2007.
  • M. Tolentino, J. Turner, and K.W. Cameron, “An Implementation of Page Allocation Shaping for Energy Efficiency,” Proceedings of the 3rd Workshop on High-performance, Power-aware Computing (HPPAC) 2007.
  • R. Ge, X. Feng, and K. W. Cameron , “Improvement of Power-Performance Efficiency for High-End Computing,” Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS 05) – HPPAC ’05 Workshop, Denver, CO, April 2005.
  • S. Byna, K. W. Cameron and X.-H. Sun, “Memory-Aware Communication -An Experimental Study with MPI,” Proceedings of the 1st International Workshop on Hardware/Software Support for Parallel and Distributed Scientific and Engineering Computing (SPDSEC02), 10 pgs, September 2002.
  • Y. Luo, K. W. Cameron, and O. Lubeck, Instruction-level Characterization of Computational Physics and Multimedia Applications Using Performance Counters. Proceedings of 2nd Workshop on Computer Architecture Evaluation Using Commercial Workloads (CAECW), pp. 12, 1999.
  • Y. Luo, O. M. Lubeck, H. Wasserman, F. Bassetti and K. W. Cameron, “Development and Validation of a Hierarchical Memory Model Incorporating CPU- and Memory-operation Overlap,” Proceedings of the 1st International Workshop on Software and Performance (WOSP ’98), pp. 152-163, October 1998.
  • O. Lubeck, A. Hoisie, F. Bassetti, K. W. Cameron, Y. Luo, and H. Wasserman, “ASCI Application Performance and the Impact of Commodity Processor Architectural Trends,” Proceedings of the International Workshop on Innovative Architecture for Future Generation High-Performance Processors and Systems, October 1998.

Book Contributions

  • S. Song* and K.W.Cameron, “Green Computing at Scale,” Harnessing Green IT: Principals and Practice, 2011, (in press).
  • R. Ge* and K.W. Cameron, “Power-aware, High-Performance Computing,” Green Computing, Wiley & Sons, 2011, (in press).
  • NSF Report on the Science of Power Management, Eds. K. Cameron and K. Pruhs, pp. 37, August 2010. (Technical Report No. VT/CS-09-19).
  • K.W. Cameron, R. Ge, and X. Feng, “Designing Computational Clusters for Performance and Power,” Advances in Computers, Elsevier Science BV, Amsterdam, 2007. (invited)
  • S. Byna, K. W. Cameron, and X.-H. Sun, “Quantification of Memory Communication,” High Performance Scientific and Engineering Computing: Hardware/Software Support, Kluwer Academic Publishers, Boston, MA, (2004) pp. 31-44.
  • S. Ashby, D. H. Bailey, M. Blackmon, P. Bohrer, K. Cameron, C. DeTar (U. Utah), J. Dongarra, D. Dwoyer, P. Freeman, A. Gheith, B. Gorda, G. Hammer, W. Felter, J. Kepner, D. Koester, S. McKee, D. Nelson, J. Nichols, M. Vahle, J. Vetter, T. Windus, P. Worley, “Performance Modeling, Metrics and Specifications,” Workshop on The Roadmap for the Revitalization of High-End Computing, Computer Research Association, Washington, DC, (2003) pp. 59-68. (invited)
  • X.-H. Sun, and K. W. Cameron, “A Statistical-Empirical Hybrid Approach to Hierarchical Memory Analysis,” Lecture Notes in Computer Science 1900. Springer Verlag Publishers, New York, NY, (2000) pp. 141-148. (from EuroPAR 2000)
  • K. W. Cameron, and Y. Luo, “Instruction-level Microprocessor Modeling of Scientific Applications,” Lecture Notes in Computer Science 1615, Springer Verlag Publishers, New York, NY, (1999) pp. 29-41. (from ISHPC 99)
  • Y. Luo and K. W. Cameron, “Instruction-level Characterization of Scientific Computing Applications Using Hardware Performance Counters,” Scientific, Engineering and Desktop Workloads of Workload Characterization: Methodology and Case Studies, IEEE-CS Press, Los Alamitos, CA, (1999) pp. 90-98.