BACK TO INDEX

RUNTIME

Publications of year 2014


Thesis

  1. Brice Goglin. Vers des mécanismes génériques de communication et une meilleure maîtrise des affinités dans les grappes de calculateurs hiérarchiques. Habilitation à diriger des recherches, University of Bordeaux, 351 cours de la Libération -- 33405 Talence cedex -- France, April 2014. [WWW]
    @PhDThesis{ Gog14HDR,
    author = {Brice Goglin},
    title = {{Vers des m\'ecanismes g\'en\'eriques de communication et une meilleure ma\^itrise des affinit\'es dans les grappes de calculateurs hi\'erarchiques}},
    school = {University of Bordeaux},
    address = {351 cours de la Lib\'eration -- 33405 Talence cedex -- France},
    year = 2014,
    month = APR,
    day = 15,
    type = {Habilitation \`a diriger des recherches},
    OPTnote = {123 pages},
    url = {http://tel.archives-ouvertes.fr/tel-00979512} 
    }
    


  2. Bertrand Putigny. Benchmark-driven Approaches to Performance Modeling of Multi-Core Architectures. PhD thesis, Université Bordeaux, 351 cours de la Libération --- 33405 TALENCE cedex, March 2014. [WWW]
    @PhDThesis{ Put14Thesis,
    author = {Bertrand Putigny},
    title = {{Benchmark-driven Approaches to Performance Modeling of Multi-Core Architectures}},
    school = {{Universit{\'e} Bordeaux}},
    address = {351 cours de la Lib{\'e}ration --- 33405 TALENCE cedex},
    year = 2014,
    month = MAR,
    url = {http://tel.archives-ouvertes.fr/tel-00984791} 
    }
    


Articles in journal or book chapters

  1. Paul-Antoine Arras, Didier Fuin, Emmanuel Jeannot, Arthur Stoutchinin, and Samuel Thibault. List Scheduling in Embedded Systems Under Memory Constraints. International Journal of Parallel Programming, pp 0, November 2014. [WWW] [doi:10.1007/s10766-014-0338-1] Keyword(s): Task graphs, Scheduling, Memory, System on chip, Video decoding.
    @article{arras:hal-01087067,
    TITLE = {{List Scheduling in Embedded Systems Under Memory Constraints}},
    AUTHOR = {Arras, Paul-Antoine and Fuin, Didier and Jeannot, Emmanuel and Stoutchinin, Arthur and Thibault, Samuel},
    URL = {https://hal.inria.fr/hal-01087067},
    JOURNAL = {{International Journal of Parallel Programming}},
    PUBLISHER = {{Springer Verlag (Germany)}},
    PAGES = {0},
    YEAR = {2014},
    MONTH = Nov,
    DOI = {10.1007/s10766-014-0338-1},
    KEYWORDS = {Task graphs, Scheduling, Memory, System on chip, Video decoding},
    HAL_ID = {hal-01087067},
    HAL_VERSION = {v1},
    
    }
    


Conference articles

  1. Marcelo Alaniz, Sergio Nesmachnow, Brice Goglin, Santiago Iturriaga, Veronica Gil Costa, and Marcela Printista. MBSPDiscover: An Automatic Benchmark for MultiBSP Performance Analysis. In Springer, editor, First HPCLATAM - CLCAR Joint Latin American High Performance Computing Conference, Communications in Computer and Information Science (CCIS), Valparaiso, Chili, October 2014. Note: STIC-AmSud SEHLOC. [WWW] [PDF]
    @inproceedings{alaniz:hal-01062528,
    hal_id = {hal-01062528},
    url = {http://hal.inria.fr/hal-01062528},
    title = {{MBSPDiscover: An Automatic Benchmark for MultiBSP Performance Analysis}},
    author = {Alaniz, Marcelo and Nesmachnow, Sergio and Goglin, Brice and Iturriaga, Santiago and Gil Costa, Veronica and Printista, Marcela},
    abstract = {{Multi-Bulk Synchronous Parallel (MultiBSP) is a recently proposed parallel programming model for multicore machines that extends the classic BSP model. MultiBSP is very useful to design algorithms and estimate their running time, which are hard to do in High Performance Computing applications. For a correct estimation of the running time, the main parameters of the MultiBSP model for different multicore architectures need to be determined. This article presents a benchmark proposal for measuring the parameters that characterize the communication and synchronization cost for the model. Our approach discovers automatically the hierarchical structure of the multicore architecture by using a specific tool (hwloc) that allows obtaining runtime information about the machine. We describe the design, implementation and the results of benchmarking two multicore machines. Furthermore, we report the validation of the proposed method by using a real MultiBSP implementation of the vector inner product algorithm and comparing the predicted execution time against the real execution time.}},
    language = {Anglais},
    affiliation = {LIDIC , CeCal - High Performance Computing , RUNTIME - INRIA Bordeaux - Sud-Ouest , Laboratoire Bordelais de Recherche en Informatique - LaBRI},
    booktitle = {{First HPCLATAM - CLCAR Joint Latin American High Performance Computing Conference}},
    address = {Valparaiso, Chili},
    editor = {Springer },
    series = {Communications in Computer and Information Science (CCIS) },
    note = {STIC-AmSud SEHLOC },
    audience = {internationale },
    year = {2014},
    month = Oct,
    pdf = {http://hal.inria.fr/hal-01062528/PDF/MultiBSP-Benchmark.pdf} 
    }
    


  2. Alexandre Denis. pioman: a Generic Framework for Asynchronous Progression and Multithreaded Communications. In IEEE International Conference on Cluster Computing (IEEE Cluster), Madrid, Espagne, September 2014. [WWW] [PDF] Keyword(s): Madeleine, MPI, PIOMan.
    @inproceedings{denis:hal-01064652,
    hal_id = {hal-01064652},
    url = {http://hal.inria.fr/hal-01064652},
    title = {{pioman: a Generic Framework for Asynchronous Progression and Multithreaded Communications}},
    author = {Denis, Alexandre},
    abstract = {{Recent cluster architectures include dozens of cores per node, with all cores sharing the network resources. To program such architectures, hybrid models mixing MPI+threads, and in particular MPI+OpenMP are gaining popularity. This imposes new requirements on communication libraries, such as the need for MPI\_THREAD\_MULTIPLE level of multi-threading support. Moreover, the high number of cores brings new opportunities to parallelize communication libraries, so as to have proper background progression of communication and communication/computation overlap. In this paper, we present pioman, a generic framework to be used by MPI implementations, that brings seamless asynchronous progression of communication by opportunistically using available cores. It uses system threads and thus is composable with any runtime system used for multithreading. Through various benchmarks, we demonstrate that our 	exttt{pioman}-based MPI implementation exhibits very good properties regarding overlap, progression, and multithreading, and outperforms state-of-art MPI implementations.}},
    language = {Anglais},
    affiliation = {Laboratoire Bordelais de Recherche en Informatique - LaBRI , RUNTIME - INRIA Bordeaux - Sud-Ouest},
    booktitle = {{IEEE International Conference on Cluster Computing (IEEE Cluster)}},
    address = {Madrid, Espagne},
    audience = {internationale },
    year = {2014},
    month = Sep,
    pdf = {http://hal.inria.fr/hal-01064652/PDF/finale-short.pdf},
    keywords = {Madeleine, MPI, PIOMan} 
    }
    


  3. Brice Goglin. Managing the Topology of Heterogeneous Cluster Nodes with Hardware Locality (hwloc). In Proceedings of 2014 International Conference on High Performance Computing & Simulation (HPCS 2014), Bologna, Italy, July 2014. [WWW] Keyword(s): hwloc.
    @InProceedings{ Gog14hwloc,
    author = {Brice Goglin},
    title = {{Managing the Topology of Heterogeneous Cluster Nodes with Hardware Locality (hwloc)}},
    booktitle = {Proceedings of 2014 International Conference on High Performance Computing \& Simulation (HPCS 2014)},
    year = 2014,
    month = JUL,
    address = {Bologna, Italy},
    url = {http://hal.inria.fr/hal-00985096},
    keywords = {hwloc} 
    }
    


  4. Brice Goglin, Joshua Hursey, and Jeffrey M. Squyres. netloc: Towards a Comprehensive View of the HPC System Topology. In Proceedings of the fifth International Workshop on Parallel Software Tools and Tool Infrastructures (PSTI 2014), held in conjunction with ICPP-2014, Minneapolis, MN, September 2014. [WWW] Keyword(s): hwloc.
    @InProceedings{ GogHurSqu14psti,
    author = {Brice Goglin and Joshua Hursey and Jeffrey M. Squyres},
    title = {{netloc: Towards a Comprehensive View of the HPC System Topology}},
    booktitle = {Proceedings of the fifth International Workshop on Parallel Software Tools and Tool Infrastructures (PSTI 2014), held in conjunction with ICPP-2014},
    year = 2014,
    month = SEP,
    address = {Minneapolis, MN},
    url = {http://hal.inria.fr/hal-01010599},
    keywords = {hwloc} 
    }
    


  5. Sylvain Henry, Alexandre Denis, Denis Barthou, Marie-Christine Counilh, and Raymond Namyst. Toward OpenCL Automatic Multi-Device Support. In Fernando Silva, Ines Dutra, and Vitor Santos Costa, editors, Euro-Par 2014, Porto, Portugal, August 2014. Springer. [WWW] [PDF] Keyword(s): StarPU.
    @inproceedings{sylvain:hal-01005765,
    hal_id = {hal-01005765},
    url = {http://hal.inria.fr/hal-01005765},
    title = {{Toward OpenCL Automatic Multi-Device Support}},
    author = {Henry, Sylvain, and Denis, Alexandre and Barthou, Denis and Counilh, Marie-Christine and Namyst, Raymond},
    language = {Anglais},
    affiliation = {Exascale Computing Research Laboratory , Laboratoire Bordelais de Recherche en Informatique - LaBRI , RUNTIME - INRIA Bordeaux - Sud-Ouest},
    booktitle = {{Euro-Par 2014}},
    publisher = {Springer},
    address = {Porto, Portugal},
    editor = {Fernando Silva and Ines Dutra and Vitor Santos Costa },
    audience = {internationale },
    year = {2014},
    month = Aug,
    pdf = {http://hal.inria.fr/hal-01005765/PDF/final.pdf},
    keywords = {StarPU} 
    }
    


  6. Xavier Lacoste, Mathieu Faverge, Pierre Ramet, Samuel Thibault, and George Bosilca. Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes. In HCW'2014 workshop of IPDPS, Phoenix, États-Unis, May 2014. IEEE. Note: RR-8446 RR-8446. [WWW] [PDF] Keyword(s): StarPU.
    @inproceedings{lacoste:hal-00987094,
    hal_id = {hal-00987094},
    url = {http://hal.inria.fr/hal-00987094},
    title = {{Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes}},
    author = {Lacoste, Xavier and Faverge, Mathieu and Ramet, Pierre and Thibault, Samuel and Bosilca, George},
    keywords = {StarPU},
    language = {Anglais},
    affiliation = {HiePACS - INRIA Bordeaux - Sud-Ouest , Laboratoire Bordelais de Recherche en Informatique - LaBRI , RUNTIME - INRIA Bordeaux - Sud-Ouest , Innovative Computing Laboratory - ICL},
    booktitle = {{HCW'2014 workshop of IPDPS}},
    publisher = {IEEE},
    address = {Phoenix, {\'E}tats-Unis},
    note = {RR-8446 RR-8446 },
    audience = {internationale },
    year = {2014},
    month = May,
    pdf = {http://hal.inria.fr/hal-00987094/PDF/sparsegpus.pdf},
    
    }
    


  7. Bertrand Putigny, Brice Goglin, and Denis Barthou. A Benchmark-based Performance Model for Memory-bound HPC Applications. In Proceedings of 2014 International Conference on High Performance Computing & Simulation (HPCS 2014), Bologna, Italy, July 2014. [WWW]
    @InProceedings{ PutGogBar14hpcs,
    author = {Bertrand Putigny and Brice Goglin and Denis Barthou},
    title = {{A Benchmark-based Performance Model for Memory-bound HPC Applications}},
    booktitle = {Proceedings of 2014 International Conference on High Performance Computing \& Simulation (HPCS 2014)},
    year = 2014,
    month = JUL,
    address = {Bologna, Italy},
    url = {http://hal.inria.fr/hal-00985598} 
    }
    


  8. Bertrand Putigny, Benoit Ruelle, and Brice Goglin. Analysis of MPI Shared-Memory Communication Performance from a Cache Coherence Perspective. In Proceedings of the 15th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC 2014), held in conjunction with IPDPS, Phoenix, AZ, May 2014. [WWW]
    @Inproceedings{ PutRueGog14pdsec,
    author = {Bertrand Putigny and Benoit Ruelle and Brice Goglin},
    title = {{Analysis of MPI Shared-Memory Communication Performance from a Cache Coherence Perspective}},
    booktitle = {Proceedings of the 15th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC 2014), held in conjunction with IPDPS},
    year = 2014,
    month = MAY,
    address = {Phoenix, AZ},
    url = {http://hal.inria.fr/hal-00956307} 
    }
    


  9. Marc Sergent and Simon Archipoff. Modulariser les ordonnanceurs de tâches : une approche structurelle. In Compas'2014, Neuchâtel, Suisse, April 2014. [WWW] [PDF] Keyword(s): StarPU.
    @inproceedings{sergent:hal-00978364,
    hal_id = {hal-00978364},
    url = {http://hal.inria.fr/hal-00978364},
    title = {{Modulariser les ordonnanceurs de t{\^a}ches : une approche structurelle}},
    author = {Sergent, Marc and Archipoff, Simon},
    keywords = {StarPU},
    language = {Fran{\c c}ais},
    affiliation = {RUNTIME - INRIA Bordeaux - Sud-Ouest},
    booktitle = {{Compas'2014}},
    address = {Neuch{\^a}tel, Suisse},
    audience = {nationale },
    year = {2014},
    month = Apr,
    pdf = {http://hal.inria.fr/hal-00978364/PDF/ordonnanceurs\_modulaires.pdf} 
    }
    


  10. Luka Stanisic, Samuel Thibault, Arnaud Legrand, Brice Videau, and Jean-François Méhaut. Modeling and Simulation of a Dynamic Task-Based Runtime System for Heterogeneous Multi-Core Architectures. In Euro-par - 20th International Conference on Parallel Processing, Porto, Portugal, August 2014. Springer-Verlag. [WWW] [PDF] Keyword(s): StarPU.
    @inproceedings{stanisic:hal-01011633,
    hal_id = {hal-01011633},
    url = {http://hal.inria.fr/hal-01011633},
    title = {{Modeling and Simulation of a Dynamic Task-Based Runtime System for Heterogeneous Multi-Core Architectures}},
    author = {Stanisic, Luka and Thibault, Samuel and Legrand, Arnaud and Videau, Brice and M{\'e}haut, Jean-Fran{\c c}ois},
    language = {Anglais},
    affiliation = {MESCAL - INRIA Grenoble Rh{\^o}ne-Alpes / LIG laboratoire d'Informatique de Grenoble , Laboratoire d'Informatique de Grenoble - LIG , Laboratoire Bordelais de Recherche en Informatique - LaBRI , RUNTIME - INRIA Bordeaux - Sud-Ouest},
    booktitle = {{Euro-par - 20th International Conference on Parallel Processing}},
    publisher = {Springer-Verlag},
    address = {Porto, Portugal},
    audience = {internationale },
    year = {2014},
    month = Aug,
    pdf = {http://hal.inria.fr/hal-01011633/PDF/StarPUSG\_article.pdf},
    keywords = {StarPU} 
    }
    


  11. Philippe Virouleau, Pierrick BRUNET, François Broquedis, Nathalie Furmento, Samuel Thibault, Olivier Aumage, and Thierry Gautier. Evaluation of OpenMP Dependent Tasks with the KASTORS Benchmark Suite. In 10th International Workshop on OpenMP, IWOMP2014, 10th International Workshop on OpenMP, IWOMP2014, Salvador, Brazil, France, pages 16 - 29, September 2014. Springer. [WWW] [doi:10.1007/978-3-319-11454-5_2] Keyword(s): OpenMP, task dependencies, benchmarks, runtime systems, KASTORS, StarPU.
    @inproceedings{virouleau:hal-01081974,
    TITLE = {{Evaluation of OpenMP Dependent Tasks with the KASTORS Benchmark Suite}},
    AUTHOR = {Virouleau, Philippe and BRUNET, Pierrick and Broquedis, Fran{\c c}ois and Furmento, Nathalie and Thibault, Samuel and Aumage, Olivier and Gautier, Thierry},
    URL = {https://hal.inria.fr/hal-01081974},
    BOOKTITLE = {{10th International Workshop on OpenMP, IWOMP2014}},
    ADDRESS = {Salvador, Brazil, France},
    PUBLISHER = {{Springer}},
    SERIES = {10th International Workshop on OpenMP, IWOMP2014},
    PAGES = {16 - 29},
    YEAR = {2014},
    MONTH = Sep,
    DOI = {10.1007/978-3-319-11454-5\_2},
    KEYWORDS = {OpenMP, task dependencies, benchmarks, runtime systems, KASTORS, StarPU},
    HAL_ID = {hal-01081974},
    HAL_VERSION = {v1},
    
    }
    


Internal reports

  1. Cédric Augonnet, Olivier Aumage, Nathalie Furmento, Samuel Thibault, and Raymond Namyst. StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators. Rapport de recherche RR-8538, INRIA, May 2014. [WWW] [PDF] Keyword(s): StarPU.
    @techreport{augonnet:hal-00992208,
    hal_id = {hal-00992208},
    url = {http://hal.inria.fr/hal-00992208},
    title = {{StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators}},
    author = {Augonnet, C{\'e}dric and Aumage, Olivier and Furmento, Nathalie and Thibault, Samuel and Namyst, Raymond},
    language = {Anglais},
    affiliation = {RUNTIME - INRIA Bordeaux - Sud-Ouest , Laboratoire Bordelais de Recherche en Informatique - LaBRI},
    type = {Rapport de recherche},
    institution = {INRIA},
    number = {RR-8538},
    year = {2014},
    month = May,
    pdf = {http://hal.inria.fr/hal-00992208/PDF/RR-8538.pdf},
    keywords = {StarPU} 
    }
    


  2. Xavier Lacoste, Mathieu Faverge, Pierre Ramet, Samuel Thibault, and George Bosilca. Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes. Rapport de recherche RR-8446, INRIA, January 2014. [WWW] [PDF] Keyword(s): StarPU.
    @techreport{lacoste:hal-00925017,
    hal_id = {hal-00925017},
    url = {http://hal.inria.fr/hal-00925017},
    title = {{Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes}},
    author = {Xavier Lacoste and Mathieu Faverge and Pierre Ramet and Samuel Thibault and George Bosilca},
    keywords = {StarPU},
    language = {Anglais},
    affiliation = {HiePACS - INRIA Bordeaux - Sud-Ouest , Laboratoire Bordelais de Recherche en Informatique - LaBRI , RUNTIME - INRIA Bordeaux - Sud-Ouest , Innovative Computing Laboratory - ICL},
    pages = {25},
    type = {Rapport de recherche},
    institution = {INRIA},
    number = {RR-8446},
    year = {2014},
    month = Jan,
    pdf = {http://hal.inria.fr/hal-00925017/PDF/RR-8446.pdf} 
    }
    


Miscellaneous

  1. Emmanuel Agullo, Olivier Aumage, Mathieu Faverge, Nathalie Furmento, Florent Pruvost, Marc Sergent, and Samuel Thibault. Overview of Distributed Linear Algebra on Hybrid Nodes over the StarPU Runtime. SIAM Conference on Parallel Processing for Scientific Computing, February 2014. [WWW] [PDF] Keyword(s): StarPU.
    @misc{sergent:hal-00978602,
    hal_id = {hal-00978602},
    url = {http://hal.inria.fr/hal-00978602},
    title = {{Overview of Distributed Linear Algebra on Hybrid Nodes over the StarPU Runtime}},
    author = {Agullo, Emmanuel and Aumage, Olivier and Faverge, Mathieu and Furmento, Nathalie and Pruvost, Florent and Sergent, Marc and Thibault, Samuel},
    language = {Anglais},
    affiliation = {RUNTIME - INRIA Bordeaux - Sud-Ouest , Laboratoire Bordelais de Recherche en Informatique - LaBRI , HiePACS - INRIA Bordeaux - Sud-Ouest},
    howpublished = {{SIAM Conference on Parallel Processing for Scientific Computing}},
    address = {Portland, Oregon, {\'E}tats-Unis},
    audience = {internationale },
    year = {2014},
    month = Feb,
    pdf = {http://hal.inria.fr/hal-00978602/PDF/siampp14.pdf},
    keywords = {StarPU} 
    }
    


  2. Adrian Tate, Amir Kamil, Anshu Dubey, Armin Grösslinger, Brad Chamberlain, Brice Goglin, H. Carter Edwards, Chris J. Newburn, David Padua, Didem Unat, Emmanuel Jeannot, Frank Hannig, Tobias Gysi, Hatem Ltaief, James Sexton, Jesus Labarta, John Shalf, Karl Fürlinger, Kathryn O’Brien, Leonidas Linardakis, Maciej Besta, Marie-Christine Sawley, Mark Abraham, Mauro Bianco, Miquel Pericàs, Naoya Maruyama, Paul H. J. Kelly, Peter Messmer, Robert B. Ross, Romain Cledat, Satoshi Matsuoka, Thomas Schulthess, Torsten Hoefler, and Vitus J. Leung. Programming Abstractions for Data Locality, November 2014.
    @misc{ PADAL14,
    author = {{Tate}, {Adrian} and {Kamil}, {Amir} and {Dubey}, {Anshu} and {Gr{\"o}{\ss}linger}, {Armin} and {Chamberlain}, {Brad} and {Goglin}, {Brice} and {Edwards}, {H. Carter} and {Newburn}, {Chris J.} and {Padua}, {David} and {Unat}, {Didem} and {Jeannot}, {Emmanuel} and {Hannig}, {Frank} and {Gysi}, {Tobias} and {Ltaief}, {Hatem} and {Sexton}, {James} and {Labarta}, {Jesus} and {Shalf}, {John} and {F{\"u}rlinger}, {Karl} and {O’Brien}, {Kathryn} and {Linardakis}, {Leonidas} and {Besta}, {Maciej} and {Sawley}, {Marie-Christine} and {Abraham}, {Mark} and {Bianco}, {Mauro} and {Peric{\`a}s}, {Miquel} and {Maruyama}, {Naoya} and {Kelly}, {Paul H. J.} and {Messmer}, {Peter} and {Ross}, {Robert B.} and {Cledat}, {Romain} and {Matsuoka}, {Satoshi} and {Schulthess}, {Thomas} and {Hoefler}, {Torsten} and {Leung}, {Vitus J.}},
    title = {Programming Abstractions for Data Locality},
    institution = {PADAL Workshop 2014, April 28--29, Swiss National Supercomputing Center (CSCS), Lugano, Switzerland},
    editor = {{Unat}, {Didem} and {Shalf}, {John} and {Hoefler}, {Torsten} and {Schulthess}, {Thomas}},
    pagetotal = {54},
    type = {White Paper},
    year = 2014,
    month = NOV 
    }
    



BACK TO INDEX




Disclaimer:

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All person copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Les documents contenus dans ces répertoires sont rendus disponibles par les auteurs qui y ont contribué en vue d'assurer la diffusion à temps de travaux savants et techniques sur une base non-commerciale. Les droits de copie et autres droits sont gardés par les auteurs et par les détenteurs du copyright, en dépit du fait qu'ils présentent ici leurs travaux sous forme électronique. Les personnes copiant ces informations doivent adhérer aux termes et contraintes couverts par le copyright de chaque auteur. Ces travaux ne peuvent pas être rendus disponibles ailleurs sans la permission explicite du détenteur du copyright.




Last modified: Tue Oct 24 15:09:39 2017
Author: sthibault.


This document was translated from BibTEX by bibtex2html