18 参考文献

Cai, Linjin, Yi-Chao Wang, William Tang, Bei Wang, Stephane Ethier, Zhao Liu, and James Lin. 2018. “Openacc Vs the Native Programming on Sunway Taihulight: A Case Study with Gtc-p.” In 2018 IEEE International Conference on Cluster Computing (CLUSTER), 88–97. IEEE.
Lin, James, Minhua Wen, Delong Meng, Xin Liu, Akira Nukada, and Satoshi Matsuoka. 2018. “Optimizing Preconditioned Conjugate Gradient on TaihuLight for OpenFOAM.” In 2018 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID), 273–82. IEEE.
Lin, James, Zhigeng Xu, Linjin Cai, Akira Nukada, and Satoshi Matsuoka. 2018. “Evaluating the SW26010 Many-Core Processor with a Micro-Benchmark Suite for Performance Optimizations.” Parallel Computing 77: 128–43.
Lin, James, Zhigeng Xu, Akira Nukada, Naoya Maruyama, and Satoshi Matsuoka. 2017. “Optimizations of Two Compute-Bound Scientific Kernels on the SW26010 Many-Core Processor.” In 2017 46th International Conference on Parallel Processing (ICPP), 432–41. IEEE.
Wang, Yi-Chao, Jin-Kun Chen, Bin-Rui Li, Si-Cheng Zuo, William Tang, Bei Wang, Qiu-Cheng Liao, Rui Xie, and James Lin. 2019. “An Empirical Study of HPC Workloads on Huawei Kunpeng 916 Processor.” In 2019 IEEE 25th International Conference on Parallel and Distributed Systems (ICPADS), 360–67. https://doi.org/10.1109/ICPADS47876.2019.00057.
Wei, Yueming, Yichao Wang, Linjin Cai, William Tang, Bei Wang, Stephane Ethier, Simon See, and James Lin. 2016. “Performance and Portability Studies with Openacc Accelerated Version of Gtc-p.” In 2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT), 13–18. IEEE.
Wu, Kaiyue, Jianwen Wei, and James Lin. 2022. “SchedP: I/o-Aware Job Scheduling in Large-Scale Production HPC Systems.” In IFIP International Conference on Network and Parallel Computing, 315–26. Springer.
Zhang, Chenchen, MinHua Wen, Bin Zhang, James Lin, and Hong Liu. 2022. “A Load-Decoupling Parallel Strategy Based on Shared Memory Architecture for DSMC to Simulate Near-Continuum Gases.” Computer Physics Communications 279: 108466.
廖秋承, 左思成, 王一超, and 林新华. 2024. 处理器性能波动检测的计时方法及评价指标.” 计算机学报 47 (456-472).
张劼, 文敏华, 林新华, 孟德龙, and 陆豪. 2018. 基于历史模拟法的风险价值算法在gpu上的实现和优化.” 计算机科学 45 (291-294+321).
文敏华, 林新华, and See Simon. 2013a. 动态网格的dsmc方法在gpu上的并行.” 计算机科学与探索 7 (472-479).
———. 2013b. 基于nvidia Kepler的PIC方法并行.” 计算机工程与科学 35 (100-104).
文敏华, 汪申鹏, 韦建文, 李林颖, 张斌, and 林新华. 2021. 基于dgx-2的湍流燃烧问题优化研究.” 计算机科学 48 (43-48).
武海鹏, 文敏华, SEE Simon, and 林新华. 2018. 激光等离子体相互作用模拟的并行和加速研究.” 计算机科学与探索 12 (550-558).
王一超, 王鎏振, and 林新华. 2022. 利用深度学习的硬件计数器复用估计算法.” 国防科技大学学报 44 (114-123).
王一超, 秦强, 施忠伟, and 林新华. 2015. 在Intel Knights Corner和NVIDIA Kepler架构上OpenACC的性能可移植性分析.” 计算机科学 42 (75-78).
王一超, 胡航, Tang William, 王蓓, and 林新华. 2020. “使用gtc-P应用评估曙光e级原型机的性能.” 计算机工程与科学 42 (1-7).
王一超, and 韦建文. 2017. 基于高性能计算平台的TensorFlow应用探索与实践.” 实验室研究与探索 36 (125-128).
韦建文, 许志耿, 王丙强, SEE Simon, and 林新华. 2017. 异构集群上的宏基因组聚类优化.” 计算机科学 44 (20-22+47).