CASC Home
     LLNL : Computation : CAR : CASC

CASC Home

Home

History

Groups

Projects

Software

Publications

Postdoctoral Program

Summer Program

Job Opportunities

Organization

Sponsors

Collaborators

Martin Schulz


Email: schulz6@llnl.gov
Phone: 925-423-6498

Mailing address:
Martin Schulz
Lawrence Livermore National Laboratory
Box 808, L-560
Livermore, CA 94551-0808


Martin is a Computer Scientist at the Center for Applied Scientific Computing (CASC) at Lawrence Livermore National Laboratory (LLNL). He earned his Doctorate in Computer Science in 2001 from the Technische Universität München (Munich, Germany). He also holds a Master of Science in Computer Science from the University of Illinois at Urbana Champaign. After completing his graduate studies and a postdoctoral appointment in Munich, he worked for two years as a Research Associate at Cornell University, before joining LLNL in 2004.

Martin's research interests include parallel and distributed architectures and applications; performance monitoring, modeling and analysis; memory system optimization; parallel programming paradigms; tool support for parallel programming; power efficiency for parallel systems; optimizing parallel and distributed I/O; and fault tolerance at the application and system level. In his position at LLNL he especially focuses on the issue of scalability for parallel applications, code correctness tools, and parallel performance analyzer as well as scalable tool infrastructures to support these efforts.

Martin is a member of LLNL's ASC CSSE ADEPT (Application Development Environment and Performance Team) and he works closely with colleagues in CASC's Computer Science Group (CSG) and in the Development Environment Group (DEG). He is also the PI for the ASC/CCE project on Open|SpeedShop and the LLNL PI for the OASCR PetaTools project on "Building a Community Tool Infrastructure around Open|SpeedShop".

Current Projects include:

  • Open|SpeedShop
    Open source performance tools for Linux clusters
  • CBTF - Component Based Tools Framework
    Modular component architecture for scalable performance analysis and debugging
  • P^nMPI (available on github) and Dynasty
    Dynamic instrumentation and tool stacking infrastructure
  • PAVE - Performance Analysis and Visualization for Exascale
    Analysis, interpretation, and correlation of performance data in intuitive domains
  • Scalable support for runtime tools
    Includes the Stack Trace Analysis Tool (STAT) and the tracing framework ScalaTrace.
  • Performance Analysis, Optimization, and Modeling of Algebraic Multigrid Methods
    In close collaboration with the hypre team
  • Working Group Lead for Tools in the MPI Forum
    MPIT: A new Performance Information Interface for MPI-3
  • Power adaptive runtime systems for parallel environments
    Reducing power consumption of parallel systems without sacrificing performance
  • Load balance analysis and optimization
    Scalable techniques to understand, model, and reduce load imbalances in parallel codes
  • Techniques for the Exploration of Parameter Spaces
    using regression and machine learning approaches
  • eGprof
    Differential profiling transparently integrated into gprof

Selected Conference Involvements

  • Area Chair for Performance, SC 2011
  • Subject Area Editor, Journal for Parallel Computing – Systems and Applications
  • General Chair, PACT 2009
  • Steering Committee of PACT
  • Vice chair for Architecture, HiPC 2008
  • Vice chair, EuroPar 2008; Advisory Board of EuroPar 2009/2010
  • Vice program chair for “Environments and Tools for Software Development” at HPCC 2006 and 2007
  • Finance and Registration Chair, PPoPP 2009
  • Workshops and Tutorial Chair, IEEE Cluster 2010
  • Program Committee of Supercomputing 2007/2010-2011 (papers) & 2008/09 (tutorials); IPDPS 2007-2009/2011; ICS 2009,2011; CCGrid 2011; CF 2011; ICPP 2007/2009/2011; ParCo 2007-2010; HPCC 2006-2009; EuroPVM/MPI 2005-2010; IGCC 2010; IEEE Cluster 2010
  • HIPS: Workshop on High-Level Parallel Programming Models and Supportive Environments (Steering Committee since 2001, Chair 2000, Co-Chair 2003 & 2008, Program Committee 2001-2002,2004-2006,2009-2011)
  • ParSim: Current Trends in Numerical Simulation for Parallel Engineering Environments (Co-Organizer since 2002)
  • SCI-Europe: International Conference on SCI-based Technology and Research (Steering Committee 2000-2001, Co-Chair 2001, Program Committee 2000-2001)

Publications

Journals

  1. M. Noeth, P. Ratn, F. Mueller, M. Schulz and B.R. de Supinski, "ScalaTrace: Scalable Compression and Replay of Communication Traces in Massively Parallel Environments," Journal of Parallel and Distributed Computing (JPDC), To appear. (LLNL-JRNL-403992).
  2. M. Schulz, J. Galarowicz, D. Maghrak, W. Hachfeld, D. Montoya, S. Cranford, “Open|SpeedShop: An Open Source Infrastructure for Parallel Performance Analysis”, Scientific Programming, Vol. 16, Number 2-3 / 2008, pp. 105-121, IOS Press, LLNL-JRNL-234840.
  3. R. Preissl, M. Schulz, D. Kranzlmueller, B.R. de Supinski and D. Quinlan, “Transforming MPI Source Code based on Communication Patterns”, Future Generation Computer Systems, Vol. 16, Number 1 / 2010, pp. 147-154, LLNL-JRNL-408081.
  4. B. de Supinski, M. Schulz, V.V. Bulatov, W. Cabot, B. Chan, A.W. Cook, E.W. Draeger, J.N. Glosli, J.A. Greenough, K. Henderson, A. Kubota, S. Louis, B.J. Miller, M.V. Patel, T.E. Spelce, F.H. Streitz, P.L. Williams, R.K. Yates, A. Yoo, G. Almasi, G. Bhanot, A. Gara, J.A. Gunnels, M. Gupta, J. Moreira, J. Sexton, B. Walkup, C. Archer, F. Gygi, T.C. Germann, K. Kadau, P.S. Lomdahl, C. Rendleman, M.L. Welcome, W. McLendon, B. Hendrickson, F. Franchetti, S. Kral. J. Lorenz, C.W. Überhuber, E. Chow and U. Catalyurek, “BlueGene/L Applications: Parallelism on a Massive Scale,” to appear in The International Journal of High Performance Computing Applications, UCRL-JRNL-224370.
  5. G. Lee, M. Schulz, D. Ahn, A. Bernat, B.R. de Supinski, S. Ko, and B. Rountree, "Dynamic Binary Instrumentation and Data Aggregation on Large Scale Systems," to appear in International Journal of Parallel Programming, UCRL-JRNL- 226801.
  6. E. Ipek, S.A. McKee, K. Singh, R. Caruana, B.R. de Supinski and M. Schulz, "Efficient Architectural Design Space Exploration via Predictive Modeling," to appear in ACM Transactions on Architecture and Code Optimization, UCRL-JRNL-227222.
  7. K. Singh, E. Ipek, S.A. McKee, B.R. de Supinski, M. Schulz and R. Caruana, "Predicting Parallel Application Performance via Machine Learning Approaches," to appear in Concurrency and Computation: Practice & Experience, UCRL-JRNL-222444.
  8. T. Brandes, H. Schwamborn, M. Gerndt, J. Jeitner, E. Kereku, W. Karl, M. Schulz, J. Tao, H. Brunst, W. Nagel, R. Neumann, R. Mueller-Pfefferkorn, B. Trenkler, and H.-C. Hoppe, “Monitoring Cache Behavior on Parallel SMP Architectures and Related Programming Tools”, Future Generation Computer Systems (FGCS), Vol.21, Nr. 8, October 2005, pp. 1298-1311.
  9. J. Tao, M. Schulz, and W. Karl, “Simulation as a Tool for Optimizing Memory Accesses on NUMA Machines”, Performance Evaluation. Vol.60, No.1-4, May 2005, pp.31-50.
  10. J. Tao, M. Schulz, and W. Karl, “Simulation as a Tool for Optimizing Memory Accesses on NUMA Machines”, Performance Evaluation, Future Generation Computer Systems. Vol.19, No.5, 2003, pp.761-776.
  11. J. Tao, M. Schulz, and W. Karl, “ARS: An Adaptive Runtime System for Locality Optimizations”, Future Generation Computer Systems (FGCS), Vol 19, No. 5, 2003, pp. 761-776.
  12. M. Schulz, J. Tao, C. Trinitis, and W. Karl, “SMiLE: An Integrated, Multi-Paradigm Software Infrastructure for SCI-based Clusters”, Future Generation Computer Systems (FGCS), Vol. 19, No. 4, pp.521-532, (Special issue with best papers of CCGrid02), 2003.
  13. G. Torralba, V. González, E. Sanchis, J. Tao, M. Schulz, and W. Karl, “Data monitoring in high-performance clusters for computing applications”, IEEE Transactions on Nuclear Science, Vol. 49, No. 2, April 2002.
  14. J. Tao, W. Karl, and M. Schulz, “Memory Access Behavior Analysis of NUMA-based Shared Memory Programs”, Scientific Computing, Special issue on Performance-Oriented Application Development for Distributed Architectures.

Conferences

  1. Anh Vo, Sriram Aananthakrishnan, Ganesh Gopalakrishnan, Bronis R. de Supinski, Martin Schulz and Greg Bronevetsky, "A Scalable and Distributed Dynamic Formal Verifier for MPI Programs," SC2010, New Orleans, Louisiana, November 13–19, 2010. (LLNL-CONF-2475*).
  2. Robert Preissl, Bronis R. de Supinski, Martin Schulz, Daniel J. Quinlan, Dieter Kranzlmueller and Thomas Panas, “Exploitation of Dynamic Communication Patterns through Static Analysis,” 2010 International Conference on Parallel Processing (ICPP-10), San Diego, CA, September 13-16, 2010. (LLNL-CONF-438991*).
  3. Karan Singh, Matthew Curtis-Maury, Sally A. McKee, Filip Blagojevic, Dimitris S. Nikolopoulos, Bronis R. de Supinski and Martin Schulz, “Comparing Scalability Prediction Strategies on an SMP of CMPs,” Euro-Par 2010, Naples, Italy, August 31–September 3, 2010. (LLNL-CONF-423717*).
  4. Greg Bronevetsky, Ignacio Laguna, Saurabh Bagchi, Bronis R. de Supinski, Dong H. Ahn and Martin Schulz, "AutomaDeD: Automata-Based Debugging for Dissimilar Parallel Tasks," 40th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2010), Chicago, IL, June 28 – July 1, 2010. (LLNL-CONF-426270*).
  5. Allison Baker, Martin Schulz, and Ulrike Yang, “On the Performance of an Algebraic Multigrid Solver on Multicore Clusters”, VecPar 2010, June 2010.
  6. Frank Mueller, Xing Wu, Martin Schulz, Todd Gamblin and Bronis R. de Supinski, "ScalaTrace: Tracing, Analysis and Modeling of HPC Codes at Scale," Para 2010: State of the Art in Scientific and Parallel Computing, Reykjavík, Iceland, June 6-9, 2010. (LLNL-CONF-427005*).
  7. Todd Gamblin, Bronis R. de Supinski, Martin Schulz, Robert J. Fowler and Daniel A. Reed, "Clustering Performance Data Efficiently at Massive Scales," Twenty Fourth International Conference on Supercomputing (ICS 2010), Tsukuba, Japan, June 1–4, 2010. (LLNL-CONF-422684*).
  8. Bradley Barnes, Jeonifer Garren, David K. Lowenthal, Jaxk Reeves, Bronis R. de Supinski, Martin Schulz and Barry Rountree, "Using Focused Regression for Accurate Time-Constrained Scaling of Scientific Applications," Twenty Fourth International Parallel and Distributed Processing Symposium (IPDPS 2010), Atlanta, GA, April 19–23, 2010. (LLNL-CONF-422989*).
  9. Dong Li, Bronis R. de Supinski, Martin Schulz, Kirk Cameron and Dimitrios S. Nikolopoulos, "Power-aware MPI Task Aggregation Prediction for High-End Computing Systems," Twenty Fourth International Parallel and Distributed Processing Symposium (IPDPS 2010), Atlanta, GA, April 19–23, 2010. (LLNL-CONF-422991*).
  10. Dong Li, Dimitrios S. Nikolopoulos, Kirk Cameron, Bronis R. de Supinski, and Martin Schulz, "Hybrid MPI/OpenMP Power-Aware Computing," Twenty Fourth International Parallel and Distributed Processing Symposium (IPDPS 2010), Atlanta, GA, April 19–23, 2010. (LLNL-CONF-422990*).
  11. D. Ahn, B.R. de Supinski, I. Laguna, G.L. Lee, B. Liblit, B.P. Miller, M. Schulz, “Scalable Temporal Order Analysis for Large Scale Debugging”,SC2009, Portland, Oregon, November, 2009. (LLNL-CONF- 412227).
  12. B. de Supinski, Sadaf Alam, D.H. Bailey, L. Carrington, C. Daley, A. Dubey, T. Gamblin, D. Gunter, P.D. Hovland, H. Jagode, K. Karavanic, G. Marin, J. Mellor-Crummey, S. Moore, B. Norris, L. Oliker, C. Olschanowsky, P.C. Roth, M. Schulz, S. Shende, A. Snavely, W. Spear, M. Tikir, J. Vetter, P. Worley, and N. Wright, “Modeling the Office of Science ten year facilities plan: The PERI Architecture Tiger Team”, SciDAC 2009, (LLNL-CONF-413427).
  13. B. Rountree, D.K. Lowenthal, B.R. de Supinski, M. Schulz, V.W. Freeh and T. Bletsch, “Adagio: Making DVS Practical for Complex HPC Applications”, Twenty Third International Conference on Supercomputing (ICS 2009), (LLNL-CONF-412083).
  14. M. Schulz, A. W. Cook, W.H. Cabot, B.R. de Supinski and W.D. Krauss, “On the Performance of the Miranda CFD Code on Multicore Architectures”, Twenty First International Conference on Parallel Computational Fluid Dynamics (ParallelCFD 2009), (LLNL-ABS-411404).
  15. M. Schulz, J. Galarowicz, D. Maghrak, W. Hachfeld, D. Montoya, S. Cranford, “Analyzing the Performance of Scientific Applications with Open|SpeedShop”,Twenty First International Conference on Parallel Computational Fluid Dynamics (ParallelCFD 2009), (LLNL-ABS- 418135).
  16. J. Li, M. Xiaosong, K. Singh, M. Schulz, B.R. de Supinski, and S.A. McKee, "Machine Learning Based Online Performance Prediction for Runtime Parallelization and Task Scheduling," 2009 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Boston, Massachusetts, April 26–28, 2009. (LLNL-CONF-407723)
  17. G. Lee, D.H. Ahn, D.C. Arnold, B.R. de Supinski, M. Legendre, B.P. Miller, M. Schulz and B. Liblit, "Lessons Learned at 208K: Towards Debugging Millions of Cores," SC2008, Austin, Texas, November 15–21, 2008. (LLNL-CONF-402967).
  18. T. Gamblin, B.R. de Supinski, M. Schulz, R.J. Fowler and D. A. Reed, "Scalable Load Balance Measurement for SPMD Codes," SC2008, Austin, Texas, November 15–21, 2008. (LLNL-CONF-406045).
  19. M. Curtis-Maury, A. Shah, F. Blagojevic, D.S. Nikolopoulos, B.R. de Supinski and M. Schulz, "Prediction Models for Multi-dimensional Power-Performance Optimizations on Many Cores," Seventeenth International Conference on Parallel Architectures and Compilation Techniques (PACT-2008), Toronto, ON, Canada, October 25–29, 2008. (LLNL-CONF-400453).
  20. D. Ahn, D.C. Arnold, B.R. de Supinski, G.L. Lee, B.P. Miller and M. Schulz, “Overcoming Scalability Challenges for Tool Daemon Launching,” 2008 International Conference on Parallel Processing (ICPP-08), Portland, OR, September 8-12, 2008. (LLNL-CONF-401480).
  21. R. Preissl, T. Koeckerbauer, M. Schulz, D. Kranzlmueller, B.R. de Supinski and D.J. Quinlan, “Detecting Patterns in MPI Communication Traces,” 2008 International Conference on Parallel Processing (ICPP-08), Portland, OR, September 8-12, 2008. (LLNL-CONF-401716).
  22. M. Schulz, G. Bronevetsky and B.R. de Supinski, “On the Performance of Transparent MPI Piggyback Messages,” EuroPVM/MPI 2008, Dublin, Ireland, September 7–10, 2008. (LLNL-CONF-402937).
  23. R. Preissl, M. Schulz, D. Kranzlmueller, B.R. de Supinski and D.J. Quinlan, “Using MPI Communication Patterns to Guide Source Code Transformations,” Tools for Program Development and Analysis in Computational Science, Krakow, Poland, June 23-25, 2008. (LLNL-CONF-400356).
  24. B. Barnes, B. Rountree, D.K. Lowenthal, J. Reeves, B.R. de Supinski and M. Schulz, "A Regression-Based Approach to Scalability Prediction," Twenty Second International Conference on Supercomputing (ICS 2008), Kos, Greece, June 7-12, 2008. (LLNL-CONF-400700).
  25. P. Ratn, F. Mueller, B.R. de Supinski and M. Schulz, "Preserving Time in Large-Scale Communication Traces," Twenty Second International Conference on Supercomputing (ICS 2008), Kos, Greece, June 7-12, 2008. (LLNL-CONF-400703).
  26. B. de Supinski, R.J. Fowler, T. Gamblin, F. Mueller, P. Ratn and M. Schulz, "An Open Infrastructure for Scalable, Reconfigurable Analysis," International Workshop on Scalable Tools for High-End Computing (STHEC), Kos, Greece, June 7, 2008. (LLNL-CONF-403954).
  27. M. Schulz and B.R. de Supinski, “P^nMPI Tools: A Whole Lot Greater than the Sum of Their Parts”, In Supercomputing 2007, Reno, NV, USA, November 12-18, 2007. (UCRL-CONF-229978).
  28. B. Rountree, D. Lowenthal, S. Funk, V.W. Freeh, B.R. de Supinski and M. Schulz, “Bounding Energy Consumption in Large-scale MPI Programs”, In Supercomputing 2007, Reno, NV, USA, November 12-18, 2007. (UCRL-CONF-233221).
  29. G. Lee, D. Ahn, B.R. de Supinski, M. Schulz, D.C. Arnold, and B.P. Miller, Benchmarking the Stack Trace Analysis Tool for BlueGene/L, In Parallel Computing: Architectures, Algorithms and Applications Proceedings of the International Conference ParCo 2007, Aachen, Germany, September 4-7, 2007. (UCRL-CONF-235241).
  30. M. Schulz and B.R. de Supinski, “Practical Differential Profiling,” Euro-Par 2007, Rennes, France, August 28 – 31, 2007. (UCRL-CONF-227812*).
  31. D. Arnold, D.H. Ahn, B.R. de Supinski, G.L. Lee, B.P. Miller and M. Schulz, "Stack Trace Analysis for Large Scale Debugging," Twenty First International Parallel and Distributed Processing Symposium (IPDPS 2007), Long Beach, CA, March 26–30, 2007. UCRL-CONF-227108.
  32. M. Noeth, F. Mueller, M. Schulz and B.R. de Supinski, "Scalable Compression and Replay of Communication Traces in Massively Parallel Environments," Twenty First International Parallel and Distributed Processing Symposium (IPDPS 2007), Long Beach, CA, March 26–30, 2007, UCRL-CONF-227098. (Best Paper Award)
  33. B. Lee, D.M. Brooks, B.R. de Supinski, M. Schulz, K. Singh and S.A. McKee, “Methods of Inference and Learning for Performance Modeling of Parallel Applications,” ACM SIGPLAN 2007 Symposium on Principles and Practice of Parallel Programming (PPoPP 2007), San Jose, CA, March 14-17, 2007. (UCRL-CONF-227097).
  34. F. Gygi, E.W. Draeger, M. Schulz, B.R. de Supinski, J.A. Gunnels, V. Austel, J.C. Sexton, F. Franchetti, S. Kral, C.W. Überhuber and J. Lorenz, "Large-Scale Electronic Structure Calculations of High-Z Metals on the BlueGene/L Platform," SC2006, Tampa, FL, November 11-17, 2006, UCRL-PROC-220592. (Gordon Bell Prize Winner)
  35. E. Ipek, S.A. McKee, B.R. de Supinski, M.Schulz and R.Caruana, “Efficiently Exploring Architectural Design Spaces via Predictive Modeling,” Twelfth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS XII), San Jose, CA, October 21-25, 2006, UCRL-CONF-223240.
  36. M. Schulz, D. Kranzlmueller and B.R. de Supinski, “Exploring Unexpected Behavior in MPI,” 2006 International Conference on High Performance Computing and Communications (HPCC-06), Munich, Germany, September 13-15, 2006, UCRL-CONF-222368.
  37. M. Schulz, and B.R. de Supinski, “A Flexible and Dynamic Infrastructure for MPI Tool Interoperability,” 2006 International Conference on Parallel Processing (ICPP-06), Columbus, OH, August 14-18, 2006, UCRL-CONF-221608.
  38. M. Noeth, F. Mueller, M. Schulz and B.R. de Supinski, "Scalable Compression and Replay of Communication Traces in Massively Parallel Environments", P=ac2 Conference, IBM T.J. Watson, Oct 2006.
  39. M. Schulz, "Extracting Critical Path Graphs from MPI Applications", IEEE Cluster 2005, September 2005, UCRL-CONF-214107.
  40. E. Ipek, B. R. de Supinski, M. Schulz, S.A. McKee, and R. Caruana, "An Approach to Performance Prediction for Parallel Applications", Euro Par 2005, Springer LNCS, 3648, August 2005, UCRL-CONF-212365.
  41. B. S. White, S. A. McKee; B. R. de Supinski, B. Miller, D. Quinlan, and M. Schulz, "Improving the Computational Intensity of Unstructured Mesh Applications", The 19th ACM International Conference on Supercomputing, June 2005, UCRL-CONF-212479.
  42. M. Schulz, B. S. White, S.A. McKee, H.-H. Lee, and J. Jeitner, “Owl: Next Generation System Monitoring”, Proceedings of ACM Computing Frontiers, April 2005, UCRL-CONF-209855.
  43. M. Schulz, G. Bronevetsky, R. Fernandes, D. Marques, K. Pingali, and P. Stodghill, "Implementation and Evaluation of a Scalable Application-level Checkpoint-Recovery Scheme for MPI Programs", Proceedings of Supercomputing 2004, November 2004, UCRL-CONF-205612.
  44. G. Bronevetsky, M. Schulz, P. Szwed., D. Marques, and K. Pingali, "Application-level Checkpointing for Shared Memory Programs", Proceedings of the Eleventh International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2004), October 2004, UCRL-CONF-205594.
  45. T. Mohan, B.R. de Supinski, S.A. McKee, F. Mueller, A. Yoo, and M. Schulz, “Identifying and Exploiting Spatial Regularity in Data Memory References”, Supercomputing 2003, November 2003.
  46. E. Wheelhouse, C. Trinitis, M. Schulz, and A. Blaszczyk, “CAD Grid: Corporate-Wide Resource Sharing for Parameter Studies”, Euro-Par 2003, European Conference on Parallel Computing, August 2003 (LNCS, Springer Verlag ).
  47. T. Mu, J. Tao, M. Schulz, and S.A. McKee, “Interactive Locality Optimizations on NUMA Architectures”, ACM Symposium on Software Visualization (Softvis), June 2003 (ACM Press).
  48. M. Schulz and S.A. McKee, “A Framework for Portable Shared Memory Programming”, International Parallel and Distributed Processing Symposium (IPDPS), April 2003 (IEEE CS Press).
  49. J. Tao, M. Schulz, and W. Karl, “A Simulation Tool for Evaluating Shared Memory Systems”, Annual Simulation Symposium (ASS), May 2003.
  50. D. Kranzlmüller and M. Schulz, “Notes on Nondeterminism in Message Passing Programs”, 9th European PVM/MPI Users´ Group Meeting, pp. 357-367, October 2002 (LNCS, Springer Verlag).
  51. C. Trinitis, M. Schulz, and W. Karl, “A Comprehensive Electric Field Simulation Environment on Top of SCI”, 9th European PVM/MPI Users´ Group Meeting, pp. 114-121, October. 2002 (LNCS, Springer Verlag).
  52. C. Trinitis, M. Schulz, and W. Karl, “Boosting the Performance of Electromagnetic Simulations on a PC-Cluster”, International Conference of Parallel Computing in Electrical Engineering (PARELEC), September 2002 (IEEE CS Press).
  53. M. Schulz, “Using Semantic Information to Guide Efficient Parallel I/O on Clusters”, Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing (HPDC-11), July 2002 (IEEE CS Press).
  54. M. Schulz, J. Tao, C. Trinitis, and W. Karl, “SMiLE: An Integrated, Multi-Paradigm Software Infrastructure for SCI-based Clusters”, Proceedings of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid), (selected for best papers publication in FGCS), May 2002 (IEEE CS Press).
  55. J. Tao, M. Schulz, and W. Karl, “Improving Data Locality Using Dynamic Page Migration based on Memory Access Histograms”, Proceedings of the International Conference on Computational Science (ICCS), session on Tools for Program Development and Analysis, April 2002 (LNCS, Springer Verlag).
  56. M. Schulz, “Parallel Volume Rendering based on Isosurface Extraction Using Commodity Clusters”, Visualization, Imaging, and Image Processing (VIIP), September 2001 (ACTA Press).
  57. J. Tao, W. Karl, and M. Schulz, “Using Simulation to Understand the Data Layout of Programs”, Applied Simulation and Modeling (ASM), September 2001 (ACTA Press).
  58. C. Trinitis, M. Schulz, M. Eberl, and W. Karl, “SCI-based LINUX PC-Clusters as a Platform for Electromagnetic Field Calculations”, 6th International Conference on Parallel Computing Technologies (PaCT 2001), September 2001 (LNCS, vol. 2127, Springer Verlag).
  59. M. Schulz, “DIOM: Parallel I/O for Data Intensive Applications on Commodity Clusters”, Parallel and Distributed Computing and Systems (PDCS), August 2001 (ACTA Press).
  60. W. Karl, M. Schulz, M. Völk, and S. Ziegler, “Meeting the Computational Demands of Nuclear Medical Imaging using Commodity Clusters”, International Conference on Computational Science (ICCS), May 2001 (LNCS, vol. 2074, Springer Verlag).
  61. J. Tao, W. Karl, and M. Schulz, “Visualizing the Memory Access Behavior of Shared Memory Applications on NUMA Architectures”, International Conference on Computational Science (ICCS), May 2001 (LNCS, vol. 2074, Springer Verlag).
  62. M. Schulz, “Efficient Deployment of shared memory models on clusters of PCs using the SMiLEing HAMSTER approach”, International Conference on Algorithms and Architectures in Parallel Processing (ICA3PP), December 2000, (World Scientific Publishing).
  63. M. Schulz, “Multithreaded Programming of PC clusters”, Parallel Architectures and Compilation Techniques (PACT), October 2000 (IEEE Press).
  64. W. Karl, M. Schulz, M. Völk, and S. Ziegler, “NEPHEW: Applying a Toolset for the Efficient Deployment of a Medical Image Application on SCI-based clusters”, Euro-Par 2000, European Conference on Parallel Computing, August 2000 (LNCS, vol. 1900, Springer Verlag).
  65. M. Schulz, “Efficient Coherency and Synchronization Management in SCI based DSM systems”, SCI-Europe 2000, August 2000.
  66. J. Tao, W. Karl, and M. Schulz, “Understanding the Behavior of Shared Memory Applications Using the SMiLE Monitoring Framework”, SCI-Europe 2000, August 2000.
  67. R. Hockauf, J. Jeitner, W. Karl, R. Lindhof, M. Schulz, V. Gonzales, E. Sanquis, and G. Torralba, “Design and Implementation Aspects for the SMiLE Hardware Monitor”, SCI-Europe 2000, August 2000.
  68. W. Karl, M. Schulz, and J. Tao, “Using the SMiLE Monitoring Infrastructure to Detect and Lower the Inefficiency of Parallel Applications”, HPCN-Europe, May 2000 (LNCS, vol. 1823, Springer Verlag).
  69. W. Karl, M. Schulz, and J. Trinitis, “Multilayer Online-Monitoring for Hybrid DSM systems on top of PC clusters with a SMiLE”, 11th International Conference on Modeling Techniques and Tools for Computer Performance Evaluation, USA, March 2000 (LNCS, vol. 1786, Springer Verlag).
  70. W. Karl, M. Leberecht, and M. Schulz, “Optimizing data locality for SCI-based PC-clusters with the SMiLE monitoring approach”, Parallel Architectures and Compilation Techniques (PACT), October 1999 (IEEE Press).
  71. M. Schulz, M. Völk, W. Karl, F. Munz, and S. Ziegler, “Running a spectral analysis code on top of SCI shared memory using the TreadMarks API”, SCI-Europe '99, September 1999.
  72. M. Schulz and H. Hellwagner, “Global Virtual Memory based on SCI-DSM”, SCI-Europe '98, September 1998.
  73. M. Schulz, SISCI Pthreads, “SMP-like programming on an SCI-cluster”, HPCN-Europe, April 1998 (LNCS, vol. 1401, Springer Verlag).
  74. X. Zhang, A. Dasdan, M. Schulz, R. Gupta, and A. Chien, “Architectural Adaptation of Application-Specific Locality Optimizations”, International Conference on Computer Design (ICCD), September 1997.

Workshops

  1. A. Humphrey, C. Derrick, B. Tibbitts, A. Vo, S. Vakkalanka, G. Gopalakrishnan, B. de Supinski, M. Schulz and G. Bronevetsky, “Verification for Portability, Scalability, and Grokkability”, (EC)2 2010: Workshop on Exploiting Concurrency Efficiently and Correctly, Edinburgh, UK, July 2010.
  2. G. Bronevetsky, Ignacio Laguna, Saurabh Bagchi, Bronis R. de Supinski, Dong H. Ahn and Martin Schulz, “Statistical Fault Detection for Parallel Applications with AutomaDeD,” The 2010 IEEE Workshop on Silicon Errors in Logic - System Effects (SELSE 6), Palo Alto, CA, March 23-24, 2010. (LLNL-CONF-426254*).
  3. F. Mueller, X. Wu, M. Schulz, B.R. de Supinski, and T. Gamblin, “ScalaTrace: Tracing, Analysis and Modeling of HPC Codes at Scale”, Para 2010: State of the Art in Scientific and Parallel Computing, Reykjavi, June 2010.
  4. T. Hilbrich, M. Schulz, B.R. de Supinski. M.S. Mueller, “MUST: A Scalable Approach to Runtime Error Detection in MPI Programs”, 3rd Parallel Tools Workshop, Dresden, Germany, September 2009.
  5. S. Biswas, D. Franklin, T. Sherwood, F. Chang, B.R. de Supinski, M. Schulz, “PSMalloc: Content Based Memory Management for MPI Applications”, MEDEA 2009, Raleigh, NC, September 2009 (LLNL-PROC-414508).
  6. B. de Supinski, R.J. Fowler, T. Gamblin, F. Mueller, P. Ratn and M. Schulz, "An Open Infrastructure for Scalable, Reconfigurable Analysis," International Workshop on Scalable Tools for High-End Computing (STHEC), Kos, Greece, June 7, 2008. (LLNL-CONF-403954).
  7. R. Preissl, M. Schulz, D. Kranzlmueller, B.R. de Supinski and D.J. Quinlan, “Using MPI Communication Patterns to Guide Source Code Transformations”, Tools for Program Development and Analysis in Computational Science, Springer LNCS, 5103, May 2008, UCRL-CONF-400356.
  8. M. Curtis-Maury, K. Singh, S.A. McKee, F. Blagojevic, D.S. Nikolopoulos, B.R. de Supinski, M. Schulz. "Identifying Energy-Efficient Concurrency Levels using Machine Learning". In Proceedings of the International Workshop on Green Computing, Austin, TX, September 2007, UCRL-CONF-233024.
  9. R. Vuduc, M. Schulz, D. Quinlan, B. R. de Supinski, and A. Saebjornsen, “Improving Distributed Memory Applications Testing by Message Perturbation”, Fourth Workshop on Parallel and Distributed Systems: Testing and Debugging (PADTAD - IV), July 17, 2006, UCRL-PROC-221395. (Best Paper Award)
  10. E. Ipek, J. Martinez, B. de Supinski, S. McKee, M. Schulz, “Dynamic Program Phase Detection in Distributed Shared-Memory Multiprocessors”, NSF Next Generation Software Program Workshop (an IPDPS 2006 Workshop), April 25, 2006, UCRL-CONF-219596.
  11. E. Ipek, M. Schulz, B. R. de Supinski, S. A. McKee, and R. Caruana. "Automatic Model Generation for Performance Prediction", Dagstuhl Workshop on Automatic Performance Analysis, Dagstuhl, Germany, December 2005, UCRL-ABS-217735.
  12. M. Schulz, D. Ahn, A. Bernat, B. R. de Supinski, S. Y. Ko, and B. Rountree, "Scalable Dynamic Instrumentation for BlueGene/L", Workshop on Binary Instrumentation and Applications (St. Louis, MO United States, September 2005), to be published ACM SIGARCH News, UCRL-CONF-215232.
  13. M. Schulz, J. May, and J. Gyllenhaal, "DynTG: A tool for Interactive, Dynamic Instrumentation", Tools for Program Development and Analysis in Computational Science, Springer LNCS, 3515, pp 140 14, May2005, UCRL-CONF-209840.
  14. T. Suh, H.-H. S. Lee, S. A. McKee, and M. Schulz. "Evaluating System-wide Monitoring Capsule Design Using Xilinx Virtex-II Pro FPGA." In Workshop on Architecture Research using FPGA Platforms in conjunction with International Symposium on High-Performance Computer Architecture, San Francisco, CA, February, 2005.
  15. M. Schulz, B. S. White, S. A. McKee, H.H.-S. Lee, and J. Jeitner, "A Vision for Next Generation System Monitoring", HPCA Workshop on Hardware Monitoring, February 2005, UCRL-ABS-208943.
  16. G. Bronevetsky G., M. Schulz, P. Szwed., D. Marques, and K. Pingali, "Checkpointing for Shared Memory Programs at the Application-level", Proceedings of the Sixth European Workshop on OpenMP, Stockholm, Sweden, Oct. 18-22, 2004, UCRL-CONF-206542.
  17. J. Tao, M. Schulz, W. Karl, “SIMT/OMP: A Toolset to Study and Exploit Memory Locality of OpenMP Applications on NUMA Architectures”, Workshop on OpenMP Applications and Tool (WOMPAT), May, 2004.
  18. P. Szwed, D. Marques, Robert M. Buels, S.A. McKee, M. Schulz, “SimCheck: Fast-Forwarding via Native Execution and Application-Level Checkpointing”, Proceedings of the 8th Workshop on Interaction Between Compilers and Computer Architecture (INTERACT 8), February 2004.
  19. T. Brandes, H. Schawmborn, M. Gerndt, J. Jeitner, E. Kereku, W. Karl, M. Schulz, J. Tao, H. Brunst, W. Nagel, R. Neumann, R. Mueller-Pfefferkorn, B. Trenkler, H.-C. Hoppe, „Werkzeuge für die effiziente parallele Programmierung von Cache-Architekturen”, Proceedings of the 19th PARS Workshop, March 2003.
  20. M. Schulz, J. Tao, and S.A. McKee, “Local Relaxed Consistency Schemes on Shared-Memory Clusters”, Proceedings of the 2nd Workshop on System Area Networks (SAN-2), held at HPCA-9, February 2003.
  21. M. Schulz and C. Trinitis, “An Integrated Parallel Simulation Environment for Electrostatic and Electromagnetic Field Distributions in High Voltage Components”, 6th Meeting of the IBM Scientific User Group, August 2002.
  22. M. Schulz, J. Tao, and W. Karl, “Improving the Scalability of Shared Memory Systems through Relaxed Consistency”, Second Workshop on Caching, Coherence and Consistency (WC3 '02) / held together with ICS'02, June 2002.
  23. M. Schulz, J. Tao, J. Jeitner, and W. Karl, “A Proposal for a New Hardware Cache Monitoring Architecture”, ACM SIGPLAN workshop on Memory Systems Performance (MSP), held together with PLDI 2002, June 2002.
  24. M. Schulz, “Overcoming the Problems Associated with the Existence of Too Many DSM APIs”, Proceedings of the 2002 International Workshop on Distributed Shared Memory on Clusters, held together with CCGrid02, May 2002 (IEEE CS Press).
  25. J. Tao, W. Karl, and M. Schulz, “A Novel Approach for Data Distribution on NUMA Machines”, Proceedings of the 6th German workshop on Parallel Systems and Algorithms, April 2002.
  26. M. Gerndt, A. Schmidt, M. Schulz, and R. Wismüller, “Perfomance Analysis of Teraflop Computers – A Distributed Automatic Approach”, 10th Euromicro Workshop on Parallel, Distributed, and Network Processing (PDP), January 2002.
  27. W. Karl and M. Schulz, “Hybrid-DSM: An Efficient Alternative to Pure Software DSM Systems on NUMA Architectures”, 2nd International Workshop on Software DSM, held together with ICS 2000, May 2000.
  28. M. Schulz, “SCI-VM: A flexible base for transparent shared memory programming models on clusters of PCs”, High level Programming Models and Supportive Environments (HIPS '99), held together with IPDPS 1999, April 1999 (LNCS, vol. 1586, Springer Verlag).
  29. M. Eberl, W. Karl, M. Leberecht, and M. Schulz, „Eine Software-Infrastruktur für Nachrichtenaustausch und gemeinsamen Speicher auf SCI-basierten PC-Clustern“, 2. Workshop on Cluster Computing, March 1999.
  30. W. Karl, M. Leberecht, and M. Schulz, “Supporting Shared Memory and Message Passing on Clusters of PCs with a SMiLE”, 3rd International Workshop on Communication, Architecture and Applications for Network-Based Parallel Computing (CANPC '99), held together with HPCA, January 1999 (LNCS, vol. 1602, Springer Verlag).
  31. M. Eberl, H. Hellwagner, B. Herland, and M. Schulz, “SISCI - Implementing a Standard Software Infrastructure on an SCI Cluster”, 1. Workshop on Cluster Computing, November 1997.

Posters

  1. D. Boehme, Martin Schulz, Bronis R. de Supinski, Markus Geimer and Felix Wolf, "Critical Path Analysis for Large-Scale MPI Programs," a poster at SC2010, New Orleans, Louisiana, November 13–19, 2010. (LLNL-POST-447564*).
  2. O. Pearce, Todd Gamblin, Martin Schulz, Bronis R. de Supinski and Nancy Amato, "Load Balance: Correlating Application-Independent Measurements with Application-Semantic Computational Models," a poster at SC2010, New Orleans, Louisiana, November 13–19, 2010. (LLNL-POST-432915*).
  3. C, Klausecker, Thomas Koeckerbauer, Martin Schulz, and Dieter Kranzlmueller, “A New Generation of Integrated Debugging Tools in Eclipse”, a poster at SC2010, New Orleans, Louisiana, November 13–19, 2010.
  4. D. Li, Bronis R. de Supinski, Martin Schulz, Kirk W. Cameron and Dimitrios S. Nikolopoulos, "Model-Based Hybrid MPI/OpenMP Power-Aware Computing," a poster at SC2009, Portland, Oregon, November 14–20, 2009. (LLNL-POST-423694*).
  5. D. Li, K.W. Cameron, D.S. Nikolopoulos, M. Schulz, and B.R. de Supinski, “Model-Based Hybrid MPI/OpenMP Power-Aware Computing”, Supercomputing 2008, November 2008.
  6. T. Gamblin, B.R. de Supinski, M. Schulz, D. Reed, and R. Fowler, “Scalable Performance Equivalence Class Detection Using Clustering”, Supercomputing 2008, November 2008.
  7. B. Rountree, D. Lowenthal, B.R. de Supinksi, M. Schulz, V. Freeh, T. Bletch, “Adagio: Saving Energy with Runtime Dynamic Voltage Scaling”, Supercomputing 2008, November 2008.
  8. T. Gamblin, P. Ratn, B.R. de Supinski, M. Schulz, F. Mueller, R.J. Fowler, D.A. Reed, “An Open Framework for Scalable, Reconfigurable Performance Analysis”, Supercomputing 2007, November 12-18, 2007, UCRL-POST-236200.
  9. R. Preissl, M. Schulz, D. Kranzlmueller, B.R. de Supinski, D.J. Quinlan, “Using MPI Communication Patterns To Guide Source Code Transformations”, Supercomputing 2007, November 12-18, 2007, UCRL-POST- 236042
  10. M. Noeth, F. Mueller, M. Schulz, and B. de Supinski, "Scalable Compression and Replay of Communication Traces in Massively Parallel Environments", Supercomputing 2006, November 11-17, 2006, UCRL-POST-225759.
  11. B. Aichinger, M. Schulz, D. Kranzmueller, R. Preissl and T. Koeckerbauer, B. de Supinski, "Patterns in Parallel Programs - Towards High-level Understanding of Large-Scale Traces", Supercomputing 2006, November 11-17, 2006, UCRL-POST-225763.
  12. M. Schulz, D. Kranzlmüller, and B. R. de Supinski, "The MPI Test Suite Unexpected Behavior in a Standardized Programming Environment", Supercomputing 2005, November 2005.
  13. T. Mu, J. Tao, M. Schulz, and S.A. McKee, “Interactive Locality Optimizations on NUMA Architectures”, ACM Symposium on Software Visualization (Softvis), June 2003.
  14. T. Mu, J. Tao, M. Schulz, and S.A. McKee, “Visualizing Data Distributions on NUMA Architectures to Guide Incremental Optimizations”, Supercomputing 2002, November 2002.
  15. M. Schulz, C. Trinitis, J. Tao, W. Karl, “SMiLE: An integrated, multiparadigm infrastructure for High Performance Computing on SCI-based Clusters”, Supercomputing 2001, November 2001.
  16. G. Torralba, V. Gonzáles, E. Sanchis, J. Tao, M. Schulz, and W. Karl, “Data Monitoring in High Performance Clusters”, 12th IEEE International Congress on Real Time for Nuclear and Plasma Sciences, NPSS, June 2001.
  17. M. Schulz, M. Voelk, W. Karl, and S. Ziegler, „Effiziente iterative PET-Bild Rekonstruktion auf einem Cluster von PCs“, Jahreskongresses der DEGRO, ÖGRO, DGMP - Band 176, Sonder¬nummer 1, October 2000.
  18. G. Acher, R. Buchty, M. Eberl, D. Fliegl, W. Karl, M. Leberecht, M. Schulz, and C. Trinitis, “High-Performance Cluster Computing”, International trade fair CeBIT '99, March 1999.
  19. M. Schulz and H. Hellwagner, “Extending NT Virtual Memory by SCI-based Hardware DSM”, Usenix Windows NT Symposium, August 1998.

Tutorials

  1. Martin Schulz, Jim Galarowicz, Don Maghrak, David Montoya, and Mahesh Rajan, “How to Analyse the Performance of Parallel Codes 101 – A Case Study with Open|SpeedShop”, SC 2010, New Orleans, LA, November 2010.
  2. Martin Schulz, Don Maghrak, “How to Analyse the Performance of Parallel Codes 101 – A Case Study with Open|SpeedShop”, SciDAC 2010, Chattanooga, TN, July 2010.
  3. Martin Schulz, Don Maghrak, David Montoya, “Performance Analysis and Optimization with Open|SpeedShop”, LCI Conference 2010, Pittsburgh, PA, USA, March 2010.
  4. Adreas Knuepfer, Dieter Kranzlmueller, Martin Schulz, Christof Klausecker, “Large Scale Communication Analysis: Tools for Understanding Highly Scalable Codes”, Supercomputing 2009, Portland, OR, USA, November 2009.
  5. Martin Schulz, Jim Galarowicz, “Performance Analysis and Optimization with Open|SpeedShop”, IEEE Cluster 2009, New Orleans, LA, USA, August 2009.
  6. Martin Schulz, Jim Galarowicz, Don Maghrak, David Montoya, Scott Cranford, “Parallel Performance Analysis with Open|SpeedShop”, Supercomputing 2008, Austin, TX, USA, November 2008.
  7. Martin Schulz, Jim Galarowicz, Samuel Gutierrez, Scott Cranford, “Parallel Performance Analysis with Open|SpeedShop”, DoD HPCMod Users’ Meeting, Seattle, WA, USA, July 2008.
  8. Martin Schulz, David Montoya, Jim Galarowicz, “Open|SpeedShop: An Open Source Performance Analysis Framework for Cluster Platforms”, High Performance Computer Science Week (HPCSW), Denver, CO, USA, April 2008.
  9. David Brooks, Bronis R. de Supinski, Benjamin Lee, Sally A. McKee, Martin Schulz, Karan Singh, “Methods of Learning and Inference for Large Design and Parameter Spaces”, International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Seattle, WA, USA, March 2008.
  10. Martin Schulz, Jim Galarowicz, David Montoya, “Open|SpeedShop: Open Source Performance Analysis for Linux Clusters”, International Conference on Supercomputing (ICS) 2007, June, 2007.
  11. David Brooks, Bronis R. de Supinski, Benjamin C. Lee, Sally A. McKee, Martin Schulz and Karan Singh, “Inference and Learning for Large Scale Microarchitectural Analysis,” The 34th International Symposium on Computer Architecture (ISCA 2007), San Diego, CA, June 9-13, 2007. (UCRL-ABS-229864*).
  12. Martin Schulz, Scott Cranford, Nathan De Bardeleben, Jim Galarowicz, Don Maghrak, “Open|SpeedShop: Open Source Performance Analysis for Linux Clusters”, Supercomputing 2006, November 11-17, 2006.
  13. Martin Schulz and Joachim Worringen, “Tutorial: SCI Low-level Programming: SISCI / SMI”, Held as part of the SCI Summer School 2001, Trinity College Dublin, Ireland, October 2001.
  14. Wolfgang Karl and Martin Schulz, “Tutorial: SMiLE Shared Memory Programming”, Held as part of the SCI Summer School 2000, Trinity College Dublin, Ireland, October 2000.

Book Chapters

  1. M. Schulz, “Checkpointing”, Encyclopedia of Parallel Computing, D. Padua (ed), Springer Verlg, (LLNL-MI-419844)
  2. B. de Supinski, Martin Schulz and Erik W. Draeger, Flexible Tools Supporting a Scalable First-Principles MD Code, a chapter in Scientific Computer Performance, Daivd H. Bailey, Robert F. Lucas and Samuel Williams, editors, Taylor and Francis, publishers, New York, NY, 2010. (UCRL-JRNL-445511*).
  3. M. Gerndt, A. Schmidt, M. Schulz, and R. Wismüller, “Automatic Performance Analysis on Hitachi SR8000”, in High Performance Computing in Science and Engineering, Munich 2002, S. Wagner, W. Hanke, A. Bode, and F. Durst (eds.), Springer Verlag, January 2003.
  4. M. Schulz, “True shared memory programming on SCI-based clusters”, in Scalable Coherent Interface / SCI, Architecture and Software for High-Performance Compute Clusters, H. Hellwagner and R. Reinefeld (eds.), LNCS State-of-the-Art Survey, vol. 1734, Springer Verlag, October 1999.

Editor

  1. M. Schulz and S. Midkiff, “Proceedings of the 13th International Workshop on High-level Parallel Programming Models and Supportive Environments”, published as part of the IPDPS workshop proceedings, April 2008.
  2. M. Gerndt and M. Schulz, “Proceedings of the 8th International Workshop on High-level Parallel Programming Models and Supportive Environments”, IEEE Computer Society Press, April 2003.
  3. B. Coghlan, G. Horn, and M. Schulz, “Proceedings of the 4th International Conference on SCI-based Research and Technology”, SINTEF, October 2001.
  4. J. Rolim, M. Schulz, et. al., “Parallel and Distributed Processing (15 IPDPS 2000 Workshops)”, Lecture Notes in Computer Science (LNCS) vol. 1800, Springer Verlag, May 2000.

Thesis

  1. M. Schulz, “Shared Memory Programming on NUMA-based Clusters using a General and Open Hybrid Hardware/Software Approach”, PhD thesis, Technische Universität München, July 2001.
  2. M. Schulz, “Application Study for the Illinois Concert C++, A Parallel Volume Renderer”, Master's thesis, University of Illinois at Urbana-Champaign, January 1997.

Other Publications

  1. C. Trinitis, M. Bader and M. Schulz, Proceedings of ParSim´09 – “Special Session on Current Trends in Numerical Simulation for Parallel Engineering Environments”, Held with the 16th European PVM/MPI User's Group Meeting, September 2009 (LNCS, Springer Verlag).
  2. C. Trinitis and M. Schulz, Proceedings of ParSim´08 – “Special Session on Current Trends in Numerical Simulation for Parallel Engineering Environments”, Held with the 15th European PVM/MPI User's Group Meeting, September 2008 (LNCS, Springer Verlag).
  3. C. Trinitis and M. Schulz, Proceedings of ParSim´07 – “Special Session on Current Trends in Numerical Simulation for Parallel Engineering Environments”, Held with the 14th European PVM/MPI User's Group Meeting, September 2007 (LNCS, Springer Verlag).
  4. B. Lee, M. Schulz, and B. de Supinski, “Regression Strategies for Parameter Space Exploration: A Case Study in Semicoarsening Multigrid and R”, Lawrence Livermore National Laboratory, September 29, 2006, UCRL-TR-224851.
  5. C. Trinitis and M. Schulz, Proceedings of ParSim´06 – “Special Session on Current Trends in Numerical Simulation for Parallel Engineering Environments”, Held with the 13th European PVM/MPI User's Group Meeting, September 2006 (LNCS, Springer Verlag).
  6. C. Trinitis and M. Schulz, Proceedings of ParSim´05 – “Special Session on Current Trends in Numerical Simulation for Parallel Engineering Environments”, Held with the 12th European PVM/MPI User's Group Meeting, September 2005 (LNCS, Springer Verlag).
  7. C. Trinitis and M. Schulz, Proceedings of ParSim´04 – “Special Session on Current Trends in Numerical Simulation for Parallel Engineering Environments”, Held with the 11th European PVM/MPI User's Group Meeting, September 2004 (LNCS, Springer Verlag).
  8. C. Trinitis and M. Schulz, Proceedings of ParSim´03 – “Special Session on Current Trends in Numerical Simulation for Parallel Engineering Environments”, Held with the 10th European PVM/MPI User's Group Meeting, September 2003 (LNCS, Springer Verlag).
  9. T. Mohan, B. de Supinski, S. McKee, F. Mueller, A. Yoo, M. Schulz, "Identifying and Exploiting Spatial Regularity in Data Memory References", Lawrence Livermore National Laboratory, July 2003, TR UCRL-JC-154597.
  10. C. Trinitis and M. Schulz, Proceedings of ParSim´02 – “Special Session on Current Trends in Numerical Simulation for Parallel Engineering Environments”, Held with the 9th European PVM/MPI User's Group Meeting, October 2002 (LNCS Volume 2474, Springer Verlag).
  11. M. Schulz, K. Inoue, B. Childers, and S.A. McKee, Guest Editors, ACM Computer Architecture News, Summer 2002 (Proceedings of HPCA-2002 WiP Session).
  12. M. Schulz, B. Childers, and S.A. McKee, Guest Editors, IEEE Technical Committee on Computer Architecture (TCCA) Newsletter, Fall 2001 (Proceedings of PACT’01 WiP Session).

Invited Presentations

  1. M. Schulz, How can Tools keep up with the Growing Size of HPC Systems?, Invited Presentation at the Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany, May 2010.
  2. M. Schulz, “How can Tools Keep up with the Growing Size of HPC Systems?”, Invited Presentation at the Leibniz Compute Center, Munich, January 2010
  3. M. Schulz, “Performance Modeling Techniques to Characterize and Optimize Scaling”, North Carolina State University, February 2009
  4. M. Schulz, “Keeping up with Growing Machine Sizes: Challenges and Opportunities for Scaling Tools”, Juelich Supercomputing Center, September 2008.
  5. M. Schulz, “Keeping up with Growing Machine Sizes: Challenges and Opportunities for Scaling Tools”, Opening Keynote at EuroPar 2008, Las Palmas de Gran Canaria, August 2008.
  6. M. Schulz, “Scalable Performance Analysis: From Open|SpeedShop to a Community Infrastructure”, Invited Keynote presentation at PROPER 2008, August 2008.
  7. M. Schulz, “Developing New Tool Strategies for Scalable HPC Systems”, North Carolina State University, May 2007.
  8. M. Schulz, "Leading the Way to Ultrascale Computing, The BlueGene/L System Software Environment", University of Linz, Austria, Department of Computer Science, December 2005.
  9. M. Schulz, "Owl: Reconfigurable, Systemwide Monitoring", HP, Palo Alto, March 2005.
  10. M. Schulz, “Reconfigurable System-wide Monitoring - Laying the Foundations for Autonomous Systems”. Northwestern University, February 2004.
  11. M. Schulz, “Reconfigurable System-wide Monitoring - Laying the Foundations for Autonomous Systems”. University of Connecticut, February 2004.
  12. M. Schulz, “Reconfigurable System-wide Monitoring - Laying the Foundations for Autonomous Systems”. University of Pittsburgh, February 2004.
  13. M. Schulz, “Owl: Rekonfigurierbares, Systemweites Monitoring – Grundstein fuer autonome Systeme”, Universität Karlsruhe, Germany, February 2004.
  14. M. Schulz, “Adaptive Systems - Foundations and Opportunities”. Lawrence Livermore National Laboratory, January 2004.
  15. M. Schulz, “HAMSTER: A framework for portable shared memory programming”, AT&T Research, NJ, USA, November 2002.
  16. M. Schulz, “Shared Memory Programming on SCI-based Clusters”, Trinity College Dublin, Ireland (as part of a lecture series for final year students), February 2002.
  17. M. Schulz, “HAMSTER: A Framework for Shared Memory Support in NUMA-based Cluster Environments”, Illinois Institute of Technology, Chicago, IL, USA, February 2002.
  18. M. Schulz, „Cluster Computing mit SCI: Von der Hardware bis zur Anwendung“, Informatik Kolloquium, Johannes Kepler Universität Linz, Austria, December 2001.
  19. M. Schulz, “Distributed Shared Memory: Shared Memory für Cluster Umgebungen”, 2nd Meeting of the KONWIHR Working Group “Tools for Porting Applications to SMP Clusters”, Technische Universität München, München, Germany, December 2001.
  20. M. Schulz, “HAMSTER: A Framework for Shared Memory Support in NUMA-based Cluster Environments”, Brown University, RI, USA, November 2001.
  21. M. Schulz, „Shared Memory Programmierung im SMiLE Projekt: Das HAMSTER System“, Max-Planck-Institut für neuropsychologische Forschung, Leipzig, Leipzig, Germany, September 2001.
  22. M. Schulz, “DSM Softwarearchitekturen und Programmierumgebungen”, Technische Universität Chemnitz, Germany (as part of the lecture "Cluster and Grid-Computing"), June 2001.
  23. M. Schulz. “Efficient Shared Memory Support in NUMA-based Cluster Environments”, IBM T.J. Watson Research Center, Yorktown Heights, NY, USA, June 2001.
  24. M. Schulz, “Efficient Shared Memory Support in NUMA-based Cluster Environments”, Lawrence Livermore National Laboratory, CASC, Livermore, CA, USA, June 2001.
  25. M. Schulz, “Shared Memory programming on top of SCI, Open SCI users workshop”, Oslo, Norway, August 1999.
  26. M. Schulz, “The SMiLE project (Shared Memory in a Lan-like Environment)”, Open SCI users workshop, Oslo, Norway, August 1999.
  27. M. Schulz, “The SMiLE Project, An overview”, Held at CVUT Prag, Prag, Czech Republic, May 1998.

 

Previous Projects at Cornell, TU-München and UIUC

Coherence Mechanisms for Aliased Memories (since 2002)

Design and evaluation of coherence schemes for systems with memory aliases. Working closely with colleagues at LLNL to provide coherence support for novel high performance memory systems.

Owl: System Monitoring and Performance Evaluation (since 2001)

Design and evaluation of a flexible system-wide monitoring framework. Initial studies focus on monitoring of cache activities.

SimSnap: Combining Native Execution and Architectural Simulation (2003-2005)

Techniques to speedup architectural simulation and to enable the use of realistic workloads.

Application-level Fault Tolerance / Cornell Checkpointing Compiler (2003-2004)

Combined use of compiler technology and run-time mechanism to provide transparent application level fault tolerance for parallel applications. Developed software and participated in the extension of existing techniques for shared memory environments.

Relaxed Hardware Coherence for NUMA architectures (2000-2003)

Studies of the use of relaxed memory consistency schemes in NUMA architectures. This project is inspired by the observation that current architectures often impose overly strict memory coherence schemes and thereby causing unnecessary memory-update traffic. Initiated and managed project.

HAMSTER: Hybrid-dsm based Adaptive and Modular Shared memory archiTEctuRe, (1999-2003)

As part of SMiLE and as a continuation of part of the efforts in SISCI, this project develops a shared memory framework. Using a small set of shared memory services, it can be retargeted to arbitrary shared memory programs independent of the architecture. Initiated and managed project.

DIOM: Distributed I/O Management (2000-2002)

Investigation of efficient parallel I/O for data intensive applications on commodity clusters and implementation of a prototype I/O management framework. Initiated and managed project.

SMiLE: Shared Memory in a Lan-like Environment (1997-2002)

Research project broadly investigating the implementation and exploitation of SCI (Scalable Coherent Interface) based clusters. This includes both the design of hardware and software components. Work within SMiLE includes extensive software development as well as various administrative and strategic tasks.

NEPHEW: Network of PCs Heterogeneous Windows-NT Engineering Toolset (1999-2000)

ESPRIT project funded by the European Union to port a graphical parallel programming package to cluster environments and to evaluate it using three real-world applications, including the iterative reconstruction of Positron Emission Tomography images. Developed software, managed project, and performed extensive dissemination. This project was done in cooperation with four partners in four European countries.

SISCI: Standard software Infrastructure for SCI-based parallel systems (1997-1999)

ESPRIT project funded by the European Union to design, implement, and test a software infrastructure for SCI (Scalable Coherent Interface) based commodity clusters. Tasks included software development, project management, and extensive dissemination. This project was done in cooperation with seven partners in four European countries.

MORPH: Configurable Computing for Petaflops (1996)

Design and evaluation of a next-generation Petaflop architecture using reconfigurable logic. Conducted initial design studies and evaluation using simulation.

Illinois Concert (1995-1996)

High-level and object-oriented parallel programming environment deploying both sophisticated compiler technology and efficient run-time mechanisms. Conducted a large application study (parallel volume rendering using surface extraction) for the Illinois Concert C++ system. LLNL-WEB-400744


Updated: 2010-12-14 08:02:40

 
computation logo
  LLNL-WEB-463959 | Privacy & Legal Notice webmaster-casc@llnl.gov February 16, 2013  
Lawrence Livermore National Laboratory
7000 East Avenue
Livermore, CA 94550
Sponsors
Operated by Lawrence Livermore National Security, LLC
for the Department of Energy's
National Nuclear Security Administration
NNSA Department of Energy LLNL Home LLNL Home LLNL People Search Search LLNL LLNL Home Visiting LLNL CASC Site Map Contact CASC