论坛首页 入门技术论坛

澄清:Java中只有按值传递,没有按引用传递!

浏览 17004 次
该帖已经被评为新手帖
作者 正文
   发表时间:2008-07-13  
OO
  前言:在JAVA面试题解惑系列(五)——传了值还是传了引用?中作者提到了“JAVA中的传递都是值传递吗?有没有引用传递呢? ”这个问题,最终得到:
引用
最后我们得出如下的结论:

   1. 基本类型和基本类型变量被当作参数传递给方法时,是值传递。在方法实体中,无法给原变量重新赋值,也无法改变它的值。
   2. 对象和引用型变量被当作参数传递给方法时,是引用传递。在方法实体中,无法给原变量重新赋值,但是可以改变它所指向对象的属性。

  事实上有着这种想法的人为数不少。但这个结论不完全正确。正确的说法应该是:在Java中,只有按值传递,没有按引用传递

  简单说,这里其实就是一个关于什么是“按引用传递”的问题。
  如果你写了这样一个方法:
swap(Type arg1, Type arg2) {  
    Type temp = arg1;  
    arg1 = arg2;  
    arg2 = temp;  
} 


  并且像下面这样调用该方法:
Type var1 = ...;
Type var2 = ...;
swap(var1,var2);

  确实能调换var1与var2的值,才可能是“按引用传递”

  有关这个问题的进一步解释,我这儿不再赘述,只给出两篇不错的文章:
  ☆  Does Java pass by reference or pass by value?
  ☆  Java is Pass-by-Value, Dammit!

  下面是第二篇的全文,有空再翻译:

Introduction

I finally decided to write up a little something about Java's parameter passing. I'm really tired of hearing folks (incorrectly) state "primitives are passed by value, objects are passed by reference".

I'm a compiler guy at heart. The terms "pass-by-value" semantics and "pass-by-reference" semantics have very precise definitions, and they're often horribly abused when folks talk about Java. I want to correct that... The following is how I'd describe these

Pass-by-value
    The actual parameter (or argument expression) is fully evaluated and the resulting value is copied into a location being used to hold the formal parameter's value during method/function execution. That location is typically a chunk of memory on the runtime stack for the application (which is how Java handles it), but other languages could choose parameter storage differently.
Pass-by-reference
    The formal parameter merely acts as an alias for the actual parameter. Anytime the method/function uses the formal parameter (for reading or writing), it is actually using the actual parameter.

Java is strictly pass-by-value, exactly as in C. Read the Java Language Specification (JLS). It's spelled out, and it's correct. (See http://java.sun.com/docs/books/jls/second_edition/html/classes.doc.html#37472)

In short: Java has pointers and is strictly pass-by-value. There's no funky rules. It's simple, clean, and clear. (Well, as clear as the evil C++-like syntax will allow ;)

Note: See the note at the end of this article for the semantics of remote method invocation (RMI). What is typically called "pass by reference" for remote objects is actually incredibly bad semantics.
The Litmus Test

There's a simple "litmus test" for whether a language supports pass-by-reference semantics:

Can you write a traditional swap(a,b) method/function in the language?

A traditional swap method or function takes two arguments and swaps them such that variables passed into the function are changed outside the function. Its basic structure looks like

// NON-JAVA!
swap(Type arg1, Type arg2) {
    Type temp = arg1;
    arg1 = arg2;
    arg2 = temp;
}


If you can write such a method/function in your language such that calling

// NON-JAVA
Type var1 = ...;
Type var2 = ...;
swap(var1,var2);


actually switches the values of the variables var1 and var2, the language supports pass-by-reference semantics.

For example, in Pascal, you can write

{ Pascal }
procedure swap(var arg1, arg2: SomeType);
  var
    temp : SomeType;
  begin
    temp := arg1;
    arg1 := arg2;
    arg2 := temp;
  end;


...
{ in some other procedure/function/program }
var
  var1, var2 : SomeType;
begin
  var1 := ...;
  var2 := ...;
  swap(var1, var2);
end;


or in C++ you could write

// C++
void swap(SomeType& arg1, Sometype& arg2) {
  SomeType temp = arg1;
  arg1 = arg2;
  arg2 = temp;
}


...

SomeType var1 = ...;
SomeType var2 = ...;
swap(var1, var2); // swaps their values!




(Please let me know if my Pascal or C++ has lapsed and I've messed up the syntax...)

But you cannot do this in Java!
Now the details...

The problem we're facing here is statements like

In Java, Objects are passed by reference, and primitives are passed by value.

This is half incorrect. Everyone can easily agree that primitives are passed by value; there's no such thing in Java as a pointer/reference to a primitive.

However, Objects are not passed by reference. A correct statement would be Object references are passed by value.

This may seem like splitting hairs, bit it is far from it. There is a world of difference in meaning. The following examples should help make the distinction.

In Java, take the case of

  public void foo(Dog d) {
    d = new Dog("Fifi");
  }

  Dog aDog = new Dog("Max");
  foo(aDog);


the variable passed in (aDog) is not modified! After calling foo, aDog still points to the "Max" Dog!

Many people mistakenly think/state that something like

  public void foo(Dog d) { 
    d.setName("Fifi");
  }

shows that Java does in fact pass objects by reference.

The mistake they make is in the definition of

  Dog d;


itself. When you write

  Dog d;


you are defining a pointer to a Dog object, not a Dog object itself.

Calling

  foo(d);


passes the value of d to foo; it does not pass the object that d points to!

The value of the pointer being passed is similar to a memory address. Under the covers it's a tad different, but you can think of it in exactly the same way. The value uniquely identifies some object on the heap.

The use of the word "reference" in Java was an incredibly poor choice (in my not-so-humble opinion...) Java has pointers, plain and simple. The designers of Java wanted to try to make a distinction between C/C++ pointers and Java pointers, so they picked another term. Under the covers, pointers are implemented very differently in Java and C/C++, and Java protects the pointer values, disallowing operations such as pointer arithmetic and invalid runtime casting.

However, it makes no difference how pointers are implemented under the covers. You program with them exactly the same way in Java as you would in C or C++. The syntax is just slightly different.

In Java,

  Dog d;   // Java


is exactly like C or C++'s

  Dog *d;  // C++


And using

  d.setName("Fifi");  // Java


is exactly like C++'s

  d->setName("Fifi"); // C++


To sum up: Java has pointers, and the value of the pointer is passed in. There's no way to actually pass an object itself as a parameter. You can only pass a pointer to an object.

Keep in mind, when you call

  foo(d);


you're not passing an object; you're passing a pointer to the object.

For a slightly different (but still correct) take on this issue, please see http://www-106.ibm.com/developerworks/library/j-praxis/pr1.html. It's from Peter Haggar's excellent book, Practical Java.)


A Note on Remote Method Invocation (RMI)

When passing parameters to remote methods, things get a bit more complex. First, we're (usually) dealing with passing data between two independent virtual machines, which might be on separate physical machines as well. Passing the value of a pointer wouldn't do any good, as the target virtual machine doesn't have access to the caller's heap.

You'll often hear "pass by value" and "pass by reference" used with respect to RMI. These terms have more of a "logical" meaning, and really aren't correct for the intended use.

Here's what is usually meant by these phrases with regard to RMI. Note that this is not proper usage of "pass by value" and "pass by reference" semantics:

RMI Pass-by-value
    The actual parameter is serialized and passed using a network protocol to the target remote object. Serialization essentially "squeezes" the data out of an object/primitive. On the receiving end, that data is used to build a "clone" of the original object or primitive. Note that this process can be rather expensive if the actual parameters point to large objects (or large graphs of objects).
    This isn't quite the right use of "pass-by-value"; I think it should really be called something like "pass-by-memento". (See "Design Patterns" by Gamma et al for a description of the Memento pattern).
    
RMI Pass-by-reference
    The actual parameter, which is itself a remote object, is represented by a proxy. The proxy keeps track of where the actual parameter lives, and anytime the target method uses the formal parameter, another remote method invocation occurs to "call back" to the actual parameter. This can be useful if the actual parameter points to a large object (or graph of objects) and there are few call backs.
    This isn't quite the right use of "pass-by-reference" (again, you cannot change the actual parameter itself). I think it should be called something like "pass-by-proxy". (Again, see "Design Patterns" for descriptions of the Proxy pattern).
   发表时间:2008-07-14  
楼主的说法有点儿牵强,所有的调用都是按值传递这是没错的,因为调用堆栈的原理限定了我们只能将各种值压入堆栈,而方法返回时,并不会将堆栈中的值再进行处理,而只是简单的调整栈顶指针将原先压入堆栈的值废弃掉。所以,一切对压栈而传递到方法体内的参数,方法内部所做的修改对外界都是无法看到的。
那么鉴于这种情况,为了将函数内部对参数的修改可以带到函数外,各种语言做了不同的处理,C/C++中可以传递指针,而Java则默认传递对象的引用。如果楼主非要把方法调用时压入堆栈的地址称为值的话,其实也并非不可,只不过这种说法我觉得有点儿牵强了。
0 请登录后投票
   发表时间:2008-07-14  
归根究底,其实就是一个对“按引用传递”这个概念理解的问题。
如果你非得说“按引用传递”,那么得重新定义一个与C++中“按引用传递”不同的概念出来。
0 请登录后投票
   发表时间:2008-07-14  
楼主把C++的例子理解错误了,那个swap(Type& arg1, Type& arg2)方法,交换的是arg1和arg2两个地址指向的内容,而不是arg1和arg2本身。
0 请登录后投票
   发表时间:2008-07-14  
是不是可以这么理解:大家对于“按引用传递”这个概念没有达成一个统一的共识才导致这么多分歧?
0 请登录后投票
   发表时间:2008-07-14  
MarkDong 写道
楼主把C++的例子理解错误了,那个swap(Type& arg1, Type& arg2)方法,交换的是arg1和arg2两个地址指向的内容,而不是arg1和arg2本身。

说白了,就是把arg1的爹传进来了,而不是arg1本身,这个爹呢,可以在生个儿子,或者直接认个新儿子。但是Java不会,传进去后,可能卸个胳膊加个腿,但是人呢还是那个人,我理解的对吧
0 请登录后投票
   发表时间:2008-07-15  
Java中的String、Integer等类型都是不可变类型,所以把这样的人传入方法内部,那么就无法像楼上说得,换个胳膊换个腿儿的,因为那是不可变的人。
而我们自己写的Bean就不同了,那是可变的机器人,传到方法内部一通Set后,面目全非了。当然了,你无法把A机器人换成B机器人,但是你可以让A机器人和B机器人看起来一样。
呵呵,听起来有点儿绕了,A机器人的头、身体、胳臂、腿儿都set成B机器人的了,但是A机器人也不是B机器人。因为它们占用的空间不一样。
0 请登录后投票
   发表时间:2008-07-15  
引用到底是什么?
Java这些概念的东西,最头痛了,看C++时候,什么都很轻松,但是看Java时候,郁闷了死了
引用是指针吗?
0 请登录后投票
   发表时间:2008-07-15  
在传递引用的时候其实是复制了一份引用传进去的.
A a=new A();
test(a)
相当于
(A b=a;
test(b)
)

1 请登录后投票
   发表时间:2008-07-15  
这个没什么好争论的吧,不管你传的是什么,传过去的都只是一个副本而已,这个副本作为方法的局部变量保存在栈中。
如果传的是基本数据类型,修改这个值并不会影响作为参数传进来的那个变量,因为你修改的是方法的局部变量,是一个副本。
如果传的是一个对象的引用,也是一样的,也是一个副本,但是这个副本和作为参数传进来的那个引用指向的是内存中的同一个对象,所以你通过这个副本也可以操作那个对象。但是如果你修改这个引用本身,比如让他指向内存中的另外一个对象,原来作为参数传进来的那个引用不会受到影响。
我觉得弄明白这些就行了,说值传递或引用传递都无所谓,但是说值传递更适合一些,这个值可以是引用也可以是基本数据类型。
0 请登录后投票
论坛首页 入门技术版

跳转论坛:
Global site tag (gtag.js) - Google Analytics