After some research, I find 3 articles about these topic to clearly clarify how function is calling. http://unixwiz.net/techtips/win32-callconv-asm.html http://www.installsetupconfig.com/win32programming/processtoolhelpapis12_1.html http://joshcarter.com/books/pragprowrimo_2009/functions_and_parameters ne of the "big picture" issues in looking at compiled C code is the function-calling conventions. These are the methods that a calling function and a called function agree on how parameters and return values should be passed between them, and how the stack is used by the function itself. The layout of the stack constitutes the "stack frame", and knowing how this works can go a long way to decoding how something works. In C and modern CPU design conventions, the stack frame is a chunk of memory, allocated from the stack, at run-time, each time a function is called, to store its automatic variables. Hence nested or recursive calls to the same function, each successively obtain their own separate frames. Physically, a function's stack frame is the area between the addresses contained in esp, the stack pointer, and ebp, the frame pointer (base pointer in Intel terminology). Thus, if a function pushes more values onto the stack, it is effectively growing its frame. This is a very low-level view: the picture as seen from the C/C++ programmer is illustrated elsewhere: • Unixwiz.net Tech Tip: Intel x86 Function-call Conventions - C Programmer's View For the sake of discussion, we're using the terms that the Microsoft Visual C compiler uses to describe these conventions, even though other platforms may use other terms. It's important to note that these are merely conventions, and any collection of cooperating code can agree on nearly anything. There are other conventions (passing parameters in registers, for instance) that behave differently, and of course the optimizer can make mincemeat of any clear picture as well. Our focus here is to provide an overview, and not an authoritative definition for these conventions. In both __cdecl and __stdcall conventions, the same set of three registers is involved in the function-call frame: Virtually everybody in the Intel assembler world uses the Intel notation, but the GNU C compiler uses what they call the "AT&T syntax" for backwards compatibility. This seems to us to be a really dumb idea, but it's a fact of life. There are minor notational differences between the two notations, but by far the most annoying is that the AT&T syntax reverses the source and destination operands. To move the immediate value 4 into the EAX register: More recent GNU compilers have a way to generate the Intel syntax, but it's not clear if the GNU assembler takes it. In any case, we'll use the Intel notation exclusively. There are other minor differences that are not of much concern to the reverse engineer. The best way to understand the stack organization is to see each step in calling a function with the __cdecl conventions. These steps are taken automatically by the compiler, and though not all of them are used in every case (sometimes no parameters, sometimes no local variables, sometimes no saved registers), but this shows the overall mechanism employed. The __stdcall convention is mainly used by the Windows API, and it's a bit more compact than __cdecl. The main difference is that any given function has a hard-coded set of parameters, and this cannot vary from call to call like it can in C (no "variadic functions"). Because the size of the parameter block is fixed, the burden of cleaning these parameters off the stack can be shifted to the called function, instead of being done by the calling function as in __cdecl. There are several effects of this: As an offshoot of #2, Microsoft Visual C takes special care of functions that are B{__stdcall}. Since the number of parameters is known at compile time, the compiler encodes the parameter byte count in the symbol name itself, and this means that calling the function wrong leads to a link error. For instance, the function int foo(int a, int b) would generate — at the assembler level — the symbol "_foo@8", where "8" is the number of bytes expected. This means that not only will a call with 1 or 3 parameters not resolve (due to the size mismatch), but neither will a call expecting the __cdecl parameters (which looks for _foo). It's a clever mechanism that avoids a lot of problems. The x86 architecture provides a number of built-in mechanisms for assisting with frame management, but they don't seem to be commonly used by C compilers. Of particular interest is theENTER instruction, which handles most of the function-prolog code. We're pretty sure these are functionally equivalent, but our 80386 processor reference suggests that the ENTER version is more compact (6 bytes -vs- 9) but slower (15 clocks -vs- 6). The newer processors are probably harder to pin down, but somebody has probably figured out that ENTER is slower. Sigh.Register use in the stack frame
*--ESP = value; // push
value = *ESP++; // pop
Assembler notation
mov $4, %eax // AT&T notation
mov eax, 4 // Intel notation
Calling a __cdecl function
push ebp
mov ebp, esp // ebp « esp
16(%ebp)
- third function parameter
12(%ebp)
- second function parameter
8(%ebp)
- first function parameter
4(%ebp)
- old %EIP (the function's "return address")
0(%ebp)
- old %EBP (previous function's base pointer)
-4(%ebp)
- first local variable
-8(%ebp)
- second local variable
-12(%ebp)
- third local variable
__cdecl -vs- __stdcall
Variations and Notes
ENTER 10,0 PUSH ebp
MOV ebp, esp
SUB esp, 10
发表评论
-
ffmpeg compile
2010-11-11 16:09 1067http://www.defendingthetruth.co ... -
Open port 80
2010-11-07 11:12 908vi /etc/sysconfig/iptables ... -
Install fast cgi & nginx in fedora
2010-11-07 10:52 1060http://www.cyberciti.biz/faq/rh ... -
Presentation Notes of workshop
2009-11-07 16:18 0they felt that the industry in ... -
化蛋abc
2009-06-12 17:12 0这是两张来自于澳洲的最新ATB 化蛋图片,我们仍然没有他的具体 ... -
Exchange consultant
2007-06-29 15:01 1075Today Exchange consultant: Ques ... -
To do list
2007-03-14 12:36 8681. Let everyone know what's SQA ... -
What will you consider, if you're the director...
2007-03-09 21:27 1115What will you consider, if you' ... -
Tips from the R&D Top Manager
2007-02-27 14:22 806Although there are most concept ... -
Prepare the process definition
2007-02-14 15:07 825Lesson1: Minor preparation you ...
相关推荐
push ebp mov ebp,esp
"反汇编代码中EBP和ESP的关系" 在高级语言反汇编之后的汇编代码中,寄存器EBP的作用是一个非常重要的主题。本文将对Intel体系结构下的函数调用与栈变化的关系进行详细的分析。 在Intel体系结构下,函数调用过程中...
- 文中提到的“推广的ESP定律”进一步扩展了ESP定律的应用范围,即使ESP值在某些情况下发生变化,也可以通过跟踪EBP值来定位OEP。 #### ESP定律的局限性与EBP的妙用 尽管ESP定律在许多情况下都非常有效,但在某些...
[pwn]ROP:通过ESP和EBP间接控制EIP-附件资源
本文将深入探讨一个Ring3级别的反作弊策略,具体是通过VC++利用EBP寄存器遍历调用栈并识别模块名。Ring3是操作系统保护模式下的用户模式,其中运行的是普通应用程序,而反作弊机制则需要在此环境中监测可疑行为。 ...
EBP寄存器在函数调用时保存了当前帧的基地址,也就是上一个函数的EBP值,这使得我们能够追溯到调用链的上一层。它像一个指针,指向堆栈帧的顶部,堆栈帧是函数执行时在堆栈上分配的一块内存区域,包含了局部变量、...
mov ebp,esp 保存堆栈指针 mov eax,[ebp + 8H] 堆栈中 ebp 指向位置之前依次保存有 ebp,cs:eip,a,b,ebp +8 指向 a add eax,[ebp + 0CH] 堆栈中 ebp + 12 处保存了 b mov esp,ebp 恢复 esp pop ebp ret 8 stdcall ...
* leal -8(%ebp), %esp:将寄存器ebp的值减去8,并将结果存储到esp寄存器中,以便在函数调用中使用。 (c)编译器的处理方式 编译器在按值传递结构变量时,会将结构体变量的所有成员复制到栈上,然后将栈指针传递给...
- `prolog`:保存当前的EBP,并更新EBP指向ESP。 - `epilog`:恢复ESP到EBP的位置,并弹出EBP。 通过以上分析,我们可以了解到80x86汇编语言在32位环境下的基本指令集、寄存器使用规则、内存模型以及函数调用约定等...
逆向错误分析的源程序,用VC2005实现!
根据给定的文件信息,以下是对"SAP SRM EBP"相关知识点的详细解析: ### SAP SRM(Supplier Relationship Management)概述 SAP SRM,全称供应商关系管理,是SAP公司提供的一套用于优化企业与供应商之间业务流程的...
新手学习基础 要知道数据类型是什么!!!排除ebp esp尽量找eax edi
《基于期望传播(EBP)的反向传播算法在Matlab中的实现》 在机器学习领域,反向传播算法(Backpropagation Algorithm)是训练神经网络最常用的优化方法之一,而期望传播(Expectation Backpropagation, EBP)是对...
This is a version of mistletoe maintained by the Excutable Book Project (EBP). It tracks the myst branch of ExecutableBookProject/mistletoe which eventually it is hoped will be merged into mistletoe ...
在C#编程中,我们通常不直接涉及到汇编级别的操作,如"Push ebp"和"mov ebp, esp",因为这些指令是属于低级语言,如x86汇编的一部分。然而,理解这些指令对于深入理解CPU的工作原理和内存管理至关重要,尤其是在底层...