当前位置：和泉文库 > 计算机 > 浏览文档

中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Optimizing existing large codebase-booklet

1 Measuring Performance What is performance ? Tools available Finding bottlenecks 2 Code modernization 3 Improving Memory Handling Context Containers and memory Container reservation Detecting offending code 4 The nightmare of thread safety Context and constraints Identifying problems Solving problems Thread contention 5 Low level optimizations Scope and target How to measure ? Improving Vectorization promises 6 Conclusion

文件格式：PDF，文件大小：1.17MB，售价：14.4元

共58页，可试读20页，点击往前阅读 ↑↑

文档详细内容（约58页）

Optimizing existing large codebase Measuire Modemise Mem threads Finding bottlenecks Understand where we can improve o analyze each part of the software o in order to find out where most time is spent o and understand whether it can be improved Most usual bottlenecks From biggest to lowest impact(usually) 。10 ●Memory 。Parallelization o Low level behavior:vectorization,cache behavior,high CPl erf tool bottlenecks 12/62 S.Ponce-CERN

Optimizing existing large codebase 12 / 62 S. Ponce - CERN Measure Modernize Mem threads low level c/c perf tools bottlenecks Finding bottlenecks Understand where we can improve analyze each part of the software in order to find out where most time is spent and understand whether it can be improved Most usual bottlenecks From biggest to lowest impact (usually) IO Memory Parallelization Low level behavior : vectorization, cache behavior, high CPI

Optimizing existing large codebase Measiare Modernice Mem threads low Make use of latest C++features oC++has evolved dramatically between 2010 and now o three new versions:C++11,C++14,C++17 o a LOT of new features targeting performance ●move semantic threading library variadic templates vectorization coming o converting existing code may already brings speed see Danilo's course for technical details o see my extended C++course if you're not at ease with the language 14/62 S.Ponce-CERN

Optimizing existing large codebase 14 / 62 S. Ponce - CERN Measure Modernize Mem threads low level c/c Make use of latest C++features C ++has evolved dramatically between 2010 and now three new versions : C++11, C++14, C++17 a LOT of new features targeting performance move semantic threading library variadic templates vectorization coming ! converting existing code may already brings speed see Danilo’s course for technical details see my extended C++course if you’re not at ease with the language

Optimizing existing large codebase Measire Modernice Mem threads lom Cleanup your code o While reviewing the code for converting to C++17: ●drop unused code 。drop unnecessary code e.g.do I really need to sort by hits here drop too generic APls if they are finally not needed replace virtual inheritance with templating when possible o consider dropping use of unmaintained libraries o It is very often surprising how much you gain there 15/62 S.Ponce-CERN

Optimizing existing large codebase 15 / 62 S. Ponce - CERN Measure Modernize Mem threads low level c/c Cleanup your code While reviewing the code for converting to C++17 : drop unused code drop unnecessary code e.g. do I really need to sort by hits here ? drop too generic APIs if they are finally not needed replace virtual inheritance with templating when possible consider dropping use of unmaintained libraries It is very often surprising how much you gain there

Optimizing existing large codebase 4 Mem threads Evolution of memory in the past decades CPU 60径yer “oore'sLaw 10国 Due to Moore's law in Processor-Memory Performance Gap: the 80s and 90s,there (grows50年/3r) 0 is a gap between CPU -DRAM 7第/小e亚 and memory DRAM performances 家委墨鉴墓鉴墨鉴墨墨鑫玉墨屋…青昌 Year Consequences o access to memory is now extremely slow(relatively) o level of caches have been introduced to mitigate good usage of caches has become a key parameter context comtainers reserving findBadCode 17/62 S.Ponce-CERN

Optimizing existing large codebase 17 / 62 S. Ponce - CERN Measure Modernize Mem threads low level c/c context containers reserving findBadCode Evolution of memory in the past decades Due to Moore’s law in the 80s and 90s, there is a gap between CPU and memory performances Consequences : access to memory is now extremely slow (relatively) level of caches have been introduced to mitigate good usage of caches has become a key parameter

Optimizing existing large codebase Mem threads To Typical cache structure size latency Ll data L1 64 kB 4 cycles instruction L2 Cache 256kB 10 cycles L3 Cache 10 MB 40 cycles DRAM 64 GB 400 cycles Typical data,on an Haswell architecture context comtainers reserving findBadCode 18/62 S.Ponce-CERN

Optimizing existing large codebase 18 / 62 S. Ponce - CERN Measure Modernize Mem threads low level c/c context containers reserving findBadCode Typical cache structure L1 data L1 instruction L2 Cache L3 Cache DRAM size latency 64 kB 4 cycles 256 kB 10 cycles 10 MB 40 cycles 64 GB 400 cycles Typical data, on an Haswell architecture

点击进入文档下载页（PDF格式）

共58页，可试读20页，点击继续阅读 ↓↓

您可能感兴趣的文档

中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Optimizing existing large codebase-pres
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Preserving data-pres
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Many ways to store data-booklet
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Many ways to store data-pres
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Structuring data for efficient I/O-booklet
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Structuring data for efficient I/O-pres
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Optimizing existing large codebase-booklet
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Optimizing existing large codebase-pres
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Modern programming languages for HEP-booklet
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Modern programming languages for HEP-pres
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Practical vectorization-booklet
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Practical vectorization-pres
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Preserving data-booklet
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Key ingredients to achieve effective I/O-pres
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Key ingredients to achieve effective I/O-booklet
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Data storage and preservation-pres
中国科学院：CERN专题计算学校《T-CSC数据存储》课程教学资源（讲义）Data storage and preservation-booklet
西安电子科技大学：《数学图像处理 Digital Image Processing Digital Image Processing》课程教学资源（授课教案）第1章绪论（许录平）
西安电子科技大学：《数学图像处理 Digital Image Processing Digital Image Processing》课程教学资源（授课教案）第2章数字图像处理基础
西安电子科技大学：《数学图像处理 Digital Image Processing Digital Image Processing》课程教学资源（授课教案）第3章图像变换
西安电子科技大学：《数学图像处理 Digital Image Processing Digital Image Processing》课程教学资源（授课教案）第4章图像增强
西安电子科技大学：《数学图像处理 Digital Image Processing Digital Image Processing》课程教学资源（授课教案）第5章图象恢复
西安电子科技大学：《数学图像处理 Digital Image Processing Digital Image Processing》课程教学资源（授课教案）第6章图像压缩编码
西安电子科技大学：《数学图像处理 Digital Image Processing Digital Image Processing》课程教学资源（授课教案）第7章图像分割

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录