当前位置：和泉文库 > 计算机 > 浏览文档

中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Optimizing existing large codebase-booklet

1 Measuring Performance What is performance ? Tools available Finding bottlenecks 2 Code modernization 3 Improving Memory Handling Context Containers and memory Container reservation Detecting offending code 4 The nightmare of thread safety Context and constraints Identifying problems Solving problems Thread contention 5 Low level optimizations Scope and target How to measure ? Improving 6 Conclusion

文件格式：PDF，文件大小：487.7KB，售价：14.62元

共59页，可试读20页，点击往前阅读 ↑↑

文档详细内容（约59页）

Optimizing existing large codebase Mem threads Io Improving Memory Handling Measuring Performance Code modernization Improving Memory Handling o Context o Containers and memory o Container reservation o Detecting offending code The nightmare of thread safety Low level optimizations ®Conclusion context comtainera reserving findBadCode 16157 S.Ponce-CERN

Optimizing existing large codebase 16 / 57 S. Ponce - CERN Measure Modernize Mem threads low level c/c context containers reserving findBadCode Improving Memory Handling 1 Measuring Performance 2 Code modernization 3 Improving Memory Handling Context Containers and memory Container reservation Detecting offending code 4 The nightmare of thread safety 5 Low level optimizations 6 Conclusion

Optimizing existing large codebase 4 Mem threads Evolution of memory in the past decades CPU 60径yer “oore'sLaw 10国 Due to Moore's law in Processor-Memory Performance Gap: the 80s and 90s,there (grows50年/3r) 0 is a gap between CPU -DRAM 7第/小e亚 and memory DRAM performances 家委墨鉴墓鉴墨鉴墨墨鑫玉墨屋…青昌 Year Consequences o access to memory is now extremely slow(relatively) o level of caches have been introduced to mitigate good usage of caches has become a key parameter context comtainers reserving findBadCode 17/57 S.Ponce-CERN

Optimizing existing large codebase 17 / 57 S. Ponce - CERN Measure Modernize Mem threads low level c/c context containers reserving findBadCode Evolution of memory in the past decades Due to Moore’s law in the 80s and 90s, there is a gap between CPU and memory performances Consequences : access to memory is now extremely slow (relatively) level of caches have been introduced to mitigate good usage of caches has become a key parameter

Optimizing existing large codebase Mem threads To Typical cache structure size latency Ll data L1 64 kB 4 cycles instruction L2 Cache 256kB 10 cycles L3 Cache 10 MB 40 cycles DRAM 64 GB 400 cycles Typical data,on an Haswell architecture context comtainers reserving findBadCode 18/57 S.Ponce-CERN

Optimizing existing large codebase 18 / 57 S. Ponce - CERN Measure Modernize Mem threads low level c/c context containers reserving findBadCode Typical cache structure L1 data L1 instruction L2 Cache L3 Cache DRAM size latency 64 kB 4 cycles 256 kB 10 cycles 10 MB 40 cycles 64 GB 400 cycles Typical data, on an Haswell architecture

Optimizing existing large codebase Mem threads Practical consequence in C++ Guidelines o we want as few heap memory allocations as possible stack usage is much better o we want continuous memory blocks,specially for containers that means containers of objects,no pointers involved e.g.vector<Obj*>or array<vector<Obj>>are banned 2 main rules o use container of objects,not of pointers ouse (const)references everywhere avoid any unnecessary copy of data o including implicit ones o use container reservation context comtainers reserving findBadCode 19/57 S.Ponce-CERN

Optimizing existing large codebase 19 / 57 S. Ponce - CERN Measure Modernize Mem threads low level c/c context containers reserving findBadCode Practical consequence in C++ Guidelines we want as few heap memory allocations as possible stack usage is much better ! we want continuous memory blocks, specially for containers that means containers of objects, no pointers involved e.g. vector<Obj*> or array<vector<Obj>> are banned ! 2 main rules use container of objects, not of pointers use (const) references everywhere avoid any unnecessary copy of data including implicit ones use container reservation

Optimizing existing large codebase Mem threads Container of objects in memory Simple vector case std::vector<int>v; 知为 X3 Vector of objects struct A float x,y,z;}; std::vector<A>v; 20 Ao A A2 contert containers reserving findBadCode 20/57 S.Ponce-CERN

Optimizing existing large codebase 20 / 57 S. Ponce - CERN Measure Modernize Mem threads low level c/c context containers reserving findBadCode Container of objects in memory Simple vector case std::vector<int> v; x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 ... Vector of objects struct A { float x, y, z; }; std::vector<A> v; x0 y0 z0 A0 x1 y1 z1 A1 x2 y2 z2 A2 x2

点击进入文档下载页（PDF格式）

共59页，试读已结束，阅读完整版请下载

您可能感兴趣的文档

中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Optimizing existing large codebase-pres
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Modern programming languages for HEP-booklet
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Modern programming languages for HEP-pres
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Practical vectorization-booklet
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Practical vectorization-pres
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Writing Parallel software（booklet）
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Writing Parallel software（pres）
中国科学院高能所计算中心：数据技术上机 Data Technologies – CERN School of Computing 2019
中国科学院高能所计算中心：数据技术课程 CSC 2018 Data Technologies Exercises（CSC DT 2018 Introduction）
中国科学院高能所计算中心：高能物理数据的存储和管理（汪璐）
南京大学：《数据结构 Data Structures》课程教学资源（PPT课件讲稿）第九章排序
南京大学：《数据结构 Data Structures》课程教学资源（PPT课件讲稿）第八章图
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Structuring data for efficient I/O-pres
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Structuring data for efficient I/O-booklet
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Many ways to store data-pres
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Many ways to store data-booklet
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Preserving data-pres
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Optimizing existing large codebase-pres
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Optimizing existing large codebase-booklet
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Preserving data-booklet
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Key ingredients to achieve effective I/O-pres
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Key ingredients to achieve effective I/O-booklet
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Data storage and preservation-pres
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Data storage and preservation-booklet

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录