selao久久国产精品,精品一卡一卡高清在线观看

本文用off-cpu火焰圖分析一個(gè)程序的延遲(主要在拿鎖上)，找出來(lái)瓶頸，并消除的故事。本文非常值得一讀，但是閱碼場(chǎng)沒(méi)有足夠的時(shí)間將其翻譯為中文，希望童鞋們直接讀英文。

The Setup

As a perf ormance engineer at MemSQL, one of my primary responsibilities is to ensure that customer Proof of Concepts (POCs) run smoothly. I was recently asked to assist with a big POC, where I was surprised to encounter an uncommon Linux performance issue. I was running a synthetic workload of 16 threads (one for each CPU core). Each one simultaneously executed a very simple query (select count(*) from t where i > 5) against a columnstore table.

In theory, this ought to be a CPU bound operation since it would be reading from a file that was already in disk buffer cache. In practice, our cores were spending about 50% of their time idle

In this post, I’ll walk through some of the debugging techniques and reveal exactly how we reached resolution.

What were our threads doing?

After confirming that our workload was indeed using 16 threads, I looked at the state of our various threads. In every refresh of myhtopwindow, I saw that a handful of threads were in theDstate corresponding to “Uninterruptible sleep”:

Why were we going off CPU?

At this point, I generated anoff-cpu flamegraphusing Linuxperf_eventsto see why we entered this state.Off-CPUmeans that instead of looking at what is keeping the CPU busy, you look at what is preventing it from being busy by things happening elsewhere (e.g. waiting for IO or a lock). The normal way to generate these visualizations is to useperf inject -s, but the machine I tested on did not have a new enough version ofperf. Instead I had to use anawkscriptI had previously written:

$ sudoperfrecord --call-graph=fp -e 'sched:sched_switch' -e 'sched:sched_stat_sleep' -e 'sched:sched_stat_blocked' --pid $(pgrep memsqld | head -n 1) -- sleep 1

[ perf record: Woken up 1 times to write data ]

[ perf record: Captured and wrote 1.343 MB perf.data (~58684 samples) ]

$ sudoperfscript -f time,comm,pid,tid,event,ip,sym,dso,trace -i sched.data | ~/FlameGraph/stackcollapse-perf-sched.awk | ~/FlameGraph/flamegraph.pl --color=io --countname=us >off-cpu.svg

Note: recording scheduler events viaperf recordcan have a very large overhead and should be used cautiously in production environments. This is why I wrap theperf recordaround asleep 1to limit the duration.

In an off-cpu flamegraph, the width of a bar is proportional to the total time spent off cpu. Here we see a lot of time is spent inrwsem_down_write_failed.

From the repeated calls torwsem_down_read_failedandrwsem_down_write_failed, we see that culprit wasmmapcontending in the kernel on themm->mmap_semlock:

down_write(&mm->mmap_sem);

ret = do_mmap_pgoff(file, addr, len, prot, flag, pgoff,&populate);

up_write(&mm->mmap_sem);

This was causing everymmapsyscall to take 10-20ms (almost half the latency of the query itself). MemSQL was so fast that that we had inadvertently written a benchmark for Linuxmmap!

The fix was simple — we switched from usingmmapto using the traditional filereadinterface. After this change, we nearly doubled our throughput and became CPU bound as we expected:

For more information and discussion around Linux performance,check out the original post on my personal blog.

Download MemSQL Community Edition to run your own performance tests for free today:memsql.com/download

Alex Reece is a systems and performance engineer. He believes in active benchmarking, root cause analysis, and fast code.

聲明：本文內(nèi)容及配圖由入駐作者撰寫或者入駐合作網(wǎng)站授權(quán)轉(zhuǎn)載。文章觀點(diǎn)僅代表作者本人，不代表電子發(fā)燒友網(wǎng)立場(chǎng)。文章及其配圖僅供工程師學(xué)習(xí)之用，如有內(nèi)容侵權(quán)或者其他違規(guī)問(wèn)題，請(qǐng)聯(lián)系本站處理。舉報(bào)投訴

cpu

cpu

+關(guān)注

關(guān)注
68

文章
10870

瀏覽量
211896
Linux

Linux

+關(guān)注

關(guān)注
87

文章
11310

瀏覽量
209614
SQL

SQL

+關(guān)注

關(guān)注
1

文章
764

瀏覽量
44155

原文標(biāo)題：用off-cpu火焰圖調(diào)查L(zhǎng)inux性能問(wèn)題

文章出處：【微信號(hào)：LinuxDev，微信公眾號(hào)：Linux閱碼場(chǎng)】歡迎添加關(guān)注！文章轉(zhuǎn)載請(qǐng)注明出處。

評(píng)論

相關(guān)推薦

Linux性能分析工具大全

今天浩道跟大家分享關(guān)于linux性能分析過(guò)程中常用到的分析工具！

發(fā)表于 01-05 09:52 ?609次閱讀

中國(guó)鋰離子電池原材料市場(chǎng)調(diào)查分析報(bào)告2008-2009版

中國(guó)鋰離子電池原材料市場(chǎng)調(diào)查分析報(bào)告2008-2009版詳細(xì)內(nèi)容請(qǐng)見:http://www.boomingfield.com/Html/yjxxcl/2008-9/18

發(fā)表于 12-29 15:12

_首屆中國(guó)嵌入式應(yīng)用狀況_調(diào)查分析報(bào)告

發(fā)表于 08-20 14:48

全志Tina中使用perf分析CPU使用率

perf簡(jiǎn)介Perf是是內(nèi)置于Linux內(nèi)核源碼樹中的性能剖析(profiling)工具。不僅可以用于應(yīng)用程序的性能統(tǒng)計(jì)分析，還可以用于內(nèi)核的性能

發(fā)表于 05-20 14:25

火焰識(shí)別

本人長(zhǎng)期從事Linux系統(tǒng)的圖像處理產(chǎn)品研發(fā)，近期在做火焰識(shí)別，火爐溫度控制，智能精準(zhǔn)滅火，最近在用樹莓派，期待本產(chǎn)品有更好的性能，我希望可以有機(jī)會(huì)試用該開發(fā)版，體驗(yàn)新產(chǎn)品的強(qiáng)大功能，同時(shí)及時(shí)反饋?zhàn)约旱挠脩趔w驗(yàn)，使雙方共贏。

發(fā)表于 07-23 10:18

CPU核心工作性能

CPU核心工作性能 CPU核心概述

發(fā)表于 12-17 10:59 ?340次閱讀

Linux CPU的性能應(yīng)該如何優(yōu)化

在Linux系統(tǒng)中，由于成本的限制，往往會(huì)存在資源上的不足，例如 CPU、內(nèi)存、網(wǎng)絡(luò)、IO 性能。本文，就對(duì) Linux 進(jìn)程和 CPU 的

發(fā)表于 01-18 08:52 ?3387次閱讀

疫情之下，中國(guó)LED顯示屏市場(chǎng)活力調(diào)查分析

疫情之下，中國(guó)LED顯示屏市場(chǎng)活力調(diào)查分析 眾所周知市場(chǎng)活躍是行業(yè)發(fā)展的主要?jiǎng)恿?，?020年初突如其來(lái)的疫情，給中國(guó)市場(chǎng)帶來(lái)了巨大的沖擊，LED顯示屏市場(chǎng)也不例外。而我們收到了行業(yè)多方面的市場(chǎng)反饋

發(fā)表于 04-02 11:23 ?1914次閱讀

火焰圖系列之使用火焰圖隱藏功能提高繪制精度

我們可以看到，火焰圖顯示， func程序占用了近四分之一的CPU時(shí)間。但是由于我們把 func綁定在CPU0和1上執(zhí)行，根據(jù)小學(xué)數(shù)學(xué)我們應(yīng)該可以計(jì)算出來(lái) func最多占用 2/32=6

發(fā)表于 06-23 10:15 ?2051次閱讀

火焰圖：全局視野的Linux性能剖析

CPU火焰圖中的每一個(gè)方框是一個(gè)函數(shù)，方框的長(zhǎng)度，代表了它的執(zhí)行時(shí)間，所以越寬的函數(shù)，執(zhí)行越久。火焰圖的樓層每高一層，就是更深一級(jí)的函數(shù)被調(diào)用，最頂層的函數(shù)，是葉子函數(shù)。

發(fā)表于 06-28 09:44 ?2060次閱讀

殺手級(jí)分析——bootchart

之前小弟一直在宣傳推廣火焰圖，結(jié)果是很多童鞋凡事都用火焰圖。說(shuō)實(shí)話，火焰圖特別適合

發(fā)表于 09-08 09:13 ?7639次閱讀

基于linux eBPF的進(jìn)程off-cpu的方法

的swap等。如下圖所示，紅色部分屬于on-cpu部分，藍(lán)色部分屬于off-cpu。一般我們用的perf命令等都是采樣on-cpu的指令進(jìn)行CPU的消耗

發(fā)表于 09-25 15:41 ?3119次閱讀

Linux問(wèn)題分析與性能優(yōu)化

文章來(lái)源于：https://mp.weixin.qq.com/s/d1NLXGp7teOgskussBXNMQ作者：alex目錄排查順序方法論性能分析工具CPU分析思路內(nèi)存

發(fā)表于 09-06 19:01 ?902次閱讀

Linux問(wèn)題故障定位的小技巧

a. on-CPU：執(zhí)行中，執(zhí)行中的時(shí)間通常又分為用戶態(tài)時(shí)間user和系統(tǒng)態(tài)時(shí)間sys。 b. off-CPU：等待下一輪上CPU，或者等待I/O、鎖、換頁(yè)等等，其狀態(tài)可以細(xì)分為可執(zhí)行、匿名換頁(yè)、睡眠、鎖、空閑等狀態(tài)。

發(fā)表于 07-09 16:30 ?421次閱讀

使用Arthas火焰圖工具的Java應(yīng)用性能分析和優(yōu)化經(jīng)驗(yàn)

分享作者在使用Arthas火焰圖工具進(jìn)行Java應(yīng)用性能分析和優(yōu)化的經(jīng)驗(yàn)。

發(fā)表于 10-28 09:27 ?280次閱讀