JOSE CARLOS
CABALEIRO DOMINGUEZ
Catedrático de universidade
JUAN CARLOS
PICHEL CAMPOS
Profesor titular de universidade
Publicacións nas que colabora con JUAN CARLOS PICHEL CAMPOS (17)
2016
-
Boosting performance of a Statistical Machine Translation system using dynamic parallelism
Journal of Computational Science, Vol. 13, pp. 37-48
-
Power and Energy Implications of the Number of Threads Used on the Intel Xeon Phi
Annals of Multicore and GPU Programming: AMGP, Vol. 3, Núm. 1, pp. 55-65
2015
-
Power and energy implications of the number of threads used on the Intel Xeon Phi
Annals of Multicore and GPU Programming: AMGP, Vol. 2, Núm. 1, pp. 55-65
2014
-
3DyRM: a dynamic roofline model including memory latency information
Journal of Supercomputing, Vol. 70, Núm. 2, pp. 696-708
-
A hardware counter-based toolkit for the analysis of memory accesses in SMPs
Concurrency Computation Practice and Experience, Vol. 26, Núm. 6, pp. 1328-1341
-
Multiobjective optimization technique based on monitoring information to increase the performance of thread migration on multicores
2014 IEEE International Conference on Cluster Computing, CLUSTER 2014
-
Using an extended Roofline Model to understand data and thread affinities on NUMA systems
Annals of Multicore and GPU Programming: AMGP, Vol. 1, Núm. 1, pp. 56-67
2013
-
A flexible and dynamic page migration infrastructure based on hardware counters
Journal of Supercomputing, Vol. 65, Núm. 2, pp. 930-948
2012
-
A graphical tool for performance analysis of multicore systems based on the Roofline Model
Proceedings of the 2012 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, ISPA 2012
-
Hardware counters based analysis of memory accesses in SMPs
Proceedings of the 2012 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, ISPA 2012
2011
-
Analyzing the execution of sparse matrix-vector product on the Finisterrae SMP-NUMA system
Journal of Supercomputing, Vol. 58, Núm. 2, pp. 195-205
2010
-
Increasing the locality of iterative methods and its application to the simulation of semiconductor devices
International Journal of High Performance Computing Applications, Vol. 24, Núm. 2, pp. 136-153
2009
-
Increasing data reuse of sparse algebra codes on simultaneous multithreading architectures
Concurrency Computation Practice and Experience, Vol. 21, Núm. 15, pp. 1838-1856
2005
-
A new technique to reduce false sharing in parallel irregular codes based on distance functions
Proceedings of the International Symposium on Parallel Architectures, Algorithms and Networks, I-SPAN
-
Performance optimization of irregular codes based on the combination of reordering and blocking techniques
Parallel Computing, Vol. 31, Núm. 8-9, pp. 858-876
2004
-
Improving the locality of the sparse matrix-vector product on shared memory multiprocessors
Proceedings - Euromicro Conference on Parellel, Distribeted and Network-based Proceeding
-
Mejora de la localidad en SMPs: el producto matriz dispersa-vector como caso de estudio
Computación de altas prestaciones: actas de las XV Jornadas de Paralelismo. Almería 15, 16 y 17 de septiembre de 2004