mysql in not in exist not exist 区别 -

sanniangmiao

浏览: 161684 次
性别:
来自: 北京

最近访客更多访客>>

sdcharles

luk7152

loginboot

dhunter

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

mysql in not in exist not exist 区别

博客分类：

mysql

mysql

Mysql In Not In 不对null 进行处理如果子查询的结果集中出现NULL 那么查询的结果集一定为 0 row

Exists Not Exists 会对Null 进行处理。

EXISTS语法并没有说哪个字段落在了子查寻的结果中，而是说exists后面的语句执行的结果是不是有记录，只要有记录，则主查询语句就成立。它代表‘存在’，用来引领嵌套查询的子查询，它不返回任何数据，只产生逻辑真值‘true’与逻辑假值‘False’。由EXISTS引出的子查询，其目标列表达式通常都用*（用null也可以），因为带有EXISTS的子查询只返回真值或假值，给出列名没有实际意义。
性能变化的关键：
#1 执行的先后顺序
谁是驱动表，谁先执行查询，谁后执行查询
#2 执行过程
exists的优点是：只要存在就返回了，这样的话很有可能不需要扫描整个表。
in需要扫描完整个表，并返回结果。
所以，在字表比较小的情况下，扫描全表和部分表基本没有差别；但在大表情况下，exists就会有优势。
看这两个语句：
--子查询会执行完全关联，并返回所有符合条件的city_id
select * from areas where id in   (select city_id from deals where deals.city_id = areas.id);

--子查询的关联其实是一样的，但子查询只要查到一个结果，就返回了，所以效率还是比较高些的

select * from areas where exists (select null     from deals where deals.city_id = areas.id);

#3 字表查询的结果
exists判断子查询的结果是不是存在，但查到什么结果，什么字段，并不关心；
in      需要子查询查得的结果给主查询使用

in 和 Exists的用法区别
1.
EXISTS的执行流程
select * from t1 where exists ( select null from t2 where y = x )
可以理解为:
    for x in ( select * from t1 )
    loop
       if ( exists ( select null from t2 where y = x.x )
       then
          OUTPUT THE RECORD
       end if
    end loop
对于in和exists的性能区别:
   如果子查询得出的结果集记录较少，主查询中的表较大且又有索引时应该用in,反之如果外层的主查询记录较少，子查询中的表大，又有索引时使用exists。
   其实我们区分in和exists主要是造成了驱动顺序的改变（这是性能变化的关键），如果是exists，那么以外层表为驱动表，先被访问，如果是IN，那么先执行子查询，所以我们会以驱动表的快速返回为目标，那么就会考虑到索引及结果集的关系了

另外IN时不对NULL进行处理
如：
select 1 from dual where null in (0,1,2,null)

2.NOT IN与NOT EXISTS:
NOT EXISTS的执行流程
select .....
   from rollup R
where not exists ( select 'Found' from title T
                              where R.source_id = T.Title_ID);
可以理解为:
for x in ( select * from rollup )
       loop
           if ( not exists ( that query ) ) then
                  OUTPUT
           end if;
        end;

注意:NOT EXISTS与 NOT IN不能完全互相替换，看具体的需求。如果选择的列可以为空，则不能被替换。

例如下面语句，看他们的区别：
select x,y from t;
x               y
------          ------
1               3
3         1
1         2
1         1
3         1
5
select * from t where   x not in (select y from t t2   )
no rows

select * from t where   not exists (select null from t t2
                                                   where t2.y=t.x )
x        y
------   ------
5        NULL
所以要具体需求来决定

对于not in和 not exists的性能区别：
    not in只有当子查询中，select 关键字后的字段有not null约束或者有这种暗示时用not in,另外如果主查询中表大，子查询中的表小但是记录多，则应当使用not in,并使用anti hash join.
   如果主查询表中记录少，子查询表中记录多，并有索引，可以使用not exists,另外not in最好也可以用/*+ HASH_AJ */或者外连接+is null
NOT IN在基于成本的应用中较好

比如:
select .....
from rollup R
where not exists ( select 'Found' from title T
                            where R.source_id = T.Title_ID);

改成（佳）

select ......
from title T, rollup R
where R.source_id = T.Title_id(+)
     and T.Title_id is null;

或者（佳）
sql> select /*+ HASH_AJ */ ...
         from rollup R
         where ource_id NOT IN ( select ource_id
                                                from title T
                                               where ource_id IS NOT NULL )

问题和解决

问题1：

--users表有1000条记录，id自增，id都大于0

select * from users where exists (select * from users limit 0); --输出多少条记录？

select * from users where exists (select * from users where id < 0); --输出多少条记录？

答案（请选中查看）：

10000条

0条

原因：

exists查询的本质，只要碰到有记录，则返回true；所以limit根本就不会去管，或者说执行不到。

问题2：

exists可以完全代替in吗？

不能。

例如：

--没有关联字段的情况：枚举常量

select * from areas where id in (4, 5, 6);

--没有关联字段的情况：这样exists对子查询，要么全true，要么全false

select * from areas where id in (select city_id from deals where deals.name = 'xxx');

举个相关exists的sql优化例子：

9、用exists替代in（发现好多程序员不知道这个怎么用）：
在许多基于基础表的查询中，为了满足一个条件，往往需要对另一个表进行联接。
在这种情况下，使用exists(或not exists)通常将提高查询的效率。
举例：
（低效）
select ... from table1 t1 where t1.id > 10 and pno in (select no from table2 where name like 'www%');
（高效）
select ... from table1 t1 where t1.id > 10 and exists (select 1 from table2 t2 where t1.pno = t2.no and name like 'www%');
10、用not exists替代not in：
在子查询中，not in子句将执行一个内部的排序和合并。
无论在哪种情况下，not in都是最低效的 (因为它对子查询中的表执行了一个全表遍历)。
为了避免使用not in，我们可以把它改写成外连接(Outer Joins)或not exists。
11、用exists替换distinct：
当提交一个包含一对多表信息的查询时,避免在select子句中使用distinct. 一般可以考虑用exists替换
举例：
（低效）
select distinct d.dept_no, d.dept_name from t_dept d, t_emp e where d.dept_no = e.dept_no;
（高效）
select d.dept_no, d.dept_name from t_dept d where exists (select 1 from t_emp where d.dept_no = e.dept_no);
exists使查询更为迅速,因为RDBMS核心模块将在子查询的条件一旦满足后,立刻返回结果.
12、用表连接替换exists：
通常来说，采用表连接的方式比exists更有效率。
举例：
（低效）
select ename from emp e where exists (select 1 from dept where dept_no = e.dept_no and dept_cat = 'W');
SELECT ENAME
（高效）
select ename from dept d, emp e where e.dept_no = d.dept_no and dept_cat = 'W';

分享到：

Restrictions 使用 | MySQL小误区：关于set global sql_slave_s ...

2014-09-17 19:59
浏览 3439
评论(0)
分类:编程语言
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

mysql in not in exist not exist 区别

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

mysql in not in exist not exist 区别

评论

发表评论

相关推荐

mysql 查询指定索引

MYSQL 函数 游标

mysql 查看数据库是否有写操作（通过mysqlbin）

Using filesort

MySQL STRAIGHT_JOIN

mysql密码忘记后重置

show processlist status

mysql的tmp_table_size和max_heap_table_size

mysql 主从复制常见问题

Mysql 表所查询

Mysql 语句避免重复插入 Insert Select Not Exist

MySQL小误区：关于set global sql_slave_skip_counter=N 命令

mysql 从数据库slave 状态为no的解决方法

MySQL里获取当前week、month、quarter的start_date/end_date

mysql 日期计算

最近访客更多访客>>

MYSQL 函数游标