oracle行预取（raw prefecting）和聚簇因子（clustering

本文主要是介绍oracle行预取（raw prefecting）和聚簇因子（clustering_factor），希望对大家解决编程问题提供一定的参考价值，需要的开发者们随着小编来一起学习吧！

oracle行预取（raw prefecting）和聚簇因子（clustering_factor）

转自：行预取（raw prefecting）和聚簇因子（clustering_factor）

背景介绍

行预取：

每次应用程序请求驱动从数据库返回1条记录的时候，会预取多条记录并将它们存储在客户端的内存中。这样，多个连续的请求就不需要执行数据库的调用来读取数据。可以直接从客户端内存中得到他们。结果，到数据库的往返次数随预取记录数量的增加呈比例的降低。因此，检索包含大量记录的结果集的开销会显著的降低；
Oracle数据库引擎只通过一次逻辑读就可以同时获取多行数据，以提高性能。一次行预取读取的行数由arraysize指定。

聚簇因子

聚簇因子表明索引中多少相邻的索引键值不指向表中相同的数据块，简单来说，聚簇因子高（即接近于表行数），表示索引键值顺序和行在数据块中的存储顺序很不一样，行预取的作用就不明显；聚簇因子低（即接近于表数据块个数），表示索引键值顺序和行在数据块中的存储顺序很相似，行预取的作用就很明显。

实际检验

实验1

创建一个包含主键的测试表：

SQL>create table t (
2 id number,
3 pad varchar2(4000),
4 constraint t_pk primary key (id)
5 );

以id升序的顺序插入1000行数据：

SQL>insert into t
2 select rownum as id, dbms_random.string('p',500) as pad
3 from dual
4 connect by level <= 1000;

查看表占用了多少数据块：

SQL>analyze table T compute statistics;
SQL>select blocks,num_rows from user_tables where table_name='T';BLOCKS NUM_ROWS
---------- ----------
73 1000

查看索引的聚簇因子：

SQL>select clustering_factor from user_indexes where index_name='T_PK';CLUSTERING_FACTOR
-----------------
72

可以发现聚簇因子和表的数据块个数相近，说明聚簇因子很低，这种情况非常理想，行预取作用明显，可以有效地降低全索引扫描的逻辑读：

SQL>set autotrace traceonly
SQL>select /*+ index(t t_pk) */ * from t;
Execution Plan
----------------------------------------------------------
0 SELECT STATEMENT ptimizer=ALL_ROWS (Cost=75 Card=1000 Bytes=503000)
1 0 TABLE ACCESS (BY INDEX ROWID) OF 'T' (TABLE) (Cost=75 Card =1000 Bytes=503000)
2 1 INDEX (FULL SCAN) OF 'T_PK' (INDEX (UNIQUE)) (Cost=3 Card=1000)
Statistics
----------------------------------------------------------
0 recursive calls
0 db block gets
205 consistent gets
0 physical reads
0 redo size
512484 bytes sent via SQL*Net to client
741 bytes received via SQL*Net from client
68 SQL*Net roundtrips to/from client
0 sorts (memory)
0 sorts (disk)
1000 rows processed

consistent gets只有205

实验2

以id无序的顺序插入

SQL>truncate table t;
SQL>insert into t
2 select rownum as id, dbms_random.string('p',500) as pad
3 from dual
4 connect by level <=1000 order by dbms_random.value;

查看表占用了多少数据块：

SQL>analyze table T compute statistics;
SQL>select blocks,num_rows from user_tables where table_name='T';BLOCKS NUM_ROWS
---------- ----------
73 1000

查看索引的聚簇因子：

SQL>select clustering_factor from user_indexes where index_name='T_PK';CLUSTERING_FACTOR
-----------------
986

可以发现聚簇因子和表的数据行数相近，说明聚簇因子很高，这种情况很不理想，行预取几乎无法发挥作用，逻辑读很高：

SQL>set autotrace traceonly
SQL>select /*+ index(t t_pk) */ * from t;
Execution Plan
----------------------------------------------------------
0 SELECT STATEMENT ptimizer=ALL_ROWS (Cost=990 Card=1000 Bytes=503000)
1 0 TABLE ACCESS (BY INDEX ROWID) OF 'T' (TABLE) (Cost=990 Card=1000 Bytes=503000)
2 1 INDEX (FULL SCAN) OF 'T_PK' (INDEX (UNIQUE)) (Cost=3 Card=1000)Statistics
----------------------------------------------------------
1 recursive calls
0 db block gets
1056 consistent gets
0 physical reads
0 redo size
512482 bytes sent via SQL*Net to client
741 bytes received via SQL*Net from client
68 SQL*Net roundtrips to/from client
0 sorts (memory)
0 sorts (disk)
1000 rows processed

consistent gets达到了1056