`
mryufeng
  • 浏览: 982486 次
  • 性别: Icon_minigender_1
  • 来自: 广州
社区版块
存档分类
最新评论

Hunting Bugs

阅读更多
转自:http://www.erlangatwork.com/2008/07/hunting-bugs.html

Our Erlang gateways were developed and deployed in phases starting with AIM/ICQ, GTalk, Yahoo, and finally MSN. Aside from minor protocol implementation bugs there were no problems and we were very satisfied with stability and performance. However, not long after releasing the MSN gateway we noticed that it used a ton of memory, and also periodically suffered massive spikes in memory use which invoked the Linux kernel's OOM killer. This lead to a crash course in debugging running Erlang apps, and a great appreciation for the years of real-world lessons that have influenced the features and design of Erlang/OTP.

The first thing I looked into is why the gateway would eventually use several gigabytes of memory with only a few hundred users online. Since each Erlang process has its own heap, I started by looking for which processes were using the most memory. erlang:processes/0 returns a list of all running processes, and erlang:process_info/1 provides a ton of information about a process including heap use, stack size, etc. So I wrote a quick script to dump the process info of all processes to a file, sorted by total memory use. This was run on the live gateway instance.

It turned out that only a few active MSN sessions were using the majority of the heap, and these sessions were for users with very large contact lists. After initial login, one session could be using > 1GB of heap.

Newer versions of the MSNP protocol use SOAP requests to get authorization tokens, contact lists, allow/block lists, etc. My initial implementation was very simple, using inets to submit the HTTP request, reading the full response body as a list, and then parsing that list with xmerl. These responses could be very large and since the gateway was running on a 64bit Erlang VM, each character would occupy 16 bytes of memory. xmerl's representation of an XML document also requires quite a bit of storage. A simple XML document such as:


<a><b>foo</b><c/></a>

is represented as:


{xmlElement,a,a,[],
             {xmlNamespace,[],[]},
             [],1,[],
             [{xmlElement,b,b,[],
                          {xmlNamespace,[],[]},
                          [{a,1}],
                          1,[],
                          [{xmlText,[{b,1},{a,1}],1,[],"foo",text}],
                          [],"/tmp/",undeclared},
              {xmlElement,c,c,[],
                          {xmlNamespace,[],[]},
                          [{a,1}],
                          2,[],[],[],undefined,undeclared}],
             [],"/tmp/",undeclared}



So I rewrote my SOAP module to use the streaming method http:request/4 which returns the HTTP response as a series of binary chunks. xmerl doesn't support parsing binaries so I switched to erlsom, which does, and also converted the XML to a very simple and compact format:


{a,[],
[{b,[],[<<"foo">>]},
  {c,[],[]}]}



After making these changes the amount of memory used per login decreased by 2.5-3x. However the gateway was still occasionally using up all available memory and dying at what appeared to be random intervals. My best guess was that something in the protocol stream was triggering this problem so I updated the gateway to log each login attempt, and ran tcpdump to capture all MSN traffic. Eventually I was able to correlate the crashes with incoming status text messages from certain contacts of a few heysan users.

MSNP transports status text as an XML payload of the UBX command:


<Data><CurrentMedia></CurrentMedia><PSM>status text</PSM></Data>


I was still using xmerl to parse this small XML document and grab the cdata from the <PSM> tag. The status text of some contacts contained combinations of UTF-8 text and numeric unicode entities such as &#x3A;. Simply attempting to parse these small XML documents would cause xmerl to allocate more than 8GB of memory and thus kill the emulator. Parsing the UBX payload with erlsom instead of xmerl completely resolved the problem, but was a bit of a letdown after so much time spent hunting hunting such an esoteric bug.

UPDATE: the crash described above is fixed in xmerl-1.1.10, which is included in Erlang/OTP R12B-4.

要善于erlang的基础设施 事半功倍!
分享到:
评论

相关推荐

    Microsoft Press Hunting Security Bugs

    Offering practical advice, hands-on guidance and code samples, this essential guide will help you to find, classify, and assess security bugs before your software is released.

    Microsoft.Press.Hunting.Security.Bugs chm

    Microsoft.Press.Hunting.Security.Bugs chm Microsoft.Press.Hunting.Security.Bugs chm

    hunting security bugs

    《Hunting Security Bugs》这本书是IT安全领域的一部经典之作,专注于探讨如何发现并消除系统中的安全漏洞。这本书深入浅出地介绍了网络安全检测的技术、方法和策略,旨在帮助读者提升网络安全防护能力,防止黑客...

    iOS.Application.Security

    Eliminating security holes in iOS apps is ... Whether you're looking to bolster your app's defenses or hunting bugs in other people's code, iOS Application Security will help you get the job done well.

    mwri-bug-hunting-with-static-code-analysis-bsides-2016

    Bug Hunting with Static Code Analysis Nick Jones 6 th June 2016 ++ Bug Hunting with Static Code Analysis + Software developers make mistakes + Mistakes = bugs = vulnerabilities + Our goal is fewer ...

    The C# Player’s Guide, 3rd Edition

    Learn to control the tools and tricks of programming in C#, including the .NET framework, dealing with compiler errors, and hunting down bugs in your program. Master the needed skills by taking on a ...

    A look at the Samsung Shannon Baseband——Marco Grassi.pdf

    在“Hunting For Bugs”部分,可能会讨论如何识别和利用这些漏洞,以及如何通过“Advanced” Debugging技术来更有效地分析基带处理器的行为。 总之,这份报告为读者提供了一次深入了解三星Shannon基带处理器的机会...

    Linux Kernel Configuration Option Reference

    请注意,在提交错误报告之前,应先阅读内核源码目录下的`README`、`MAINTAINERS`、`REPORTING-BUGS`、`Documentation/BUG-HUNTING` 和 `Documentation/oops-tracing.txt`等文档。 此外,此选项还使过时的驱动程序...

    基于ssm+mysql高校就业管理系统设计与实现.docx

    In terms of the long-term development of society, college graduates' job hunting becomes more convenient, leading to better prospects. Through this website, everyone can apply for company positions ...

Global site tag (gtag.js) - Google Analytics