- 浏览: 978862 次
- 性别:
- 来自: 杭州
文章分类
最新评论
-
孤星119:
好熟悉的数据库字段啊, 上家公司做的项目每天都跟这些字段打招呼 ...
Oracle exp compress参数引起的空间浪费 -
itspace:
quxiaoyong 写道遇到个问题,网上一搜,全他妈这篇文章 ...
数据库连接错误ORA-28547 -
quxiaoyong:
遇到个问题,网上一搜,全他妈这篇文章。你转来转去的有意思吗?
数据库连接错误ORA-28547 -
hctech:
关于version count过高的问题,不知博主是否看过ey ...
某客户数据库性能诊断报告 -
itspace:
invalid 写道写的不错,我根据这个来安装,有点理解错误了 ...
AIX 配置vncserver
首先来看一下故障发生的来龙去脉:
数据库版本:9.2.0.4
操作系统版本:5200-04
1、日志推进至95761,准备归档95760,未释放redo02.log
Fri Feb 25 04:22:00 2011
Thread 1 advanced to log sequence 95760
Current log# 2 seq# 95760 mem# 0: /zzrkdata/zzrk/redo02.log
。。。
Thread 1 advanced to log sequence 95761
Current log# 1 seq# 95761 mem# 0: /zzrkdata/zzrk/redo01.log
Fri Feb 25 04:23:11 2011
ARC0: Evaluating archive log 2 thread 1 sequence 95760
ARC0: Beginning to archive log 2 thread 1 sequence 95760
Creating archive destination LOG_ARCHIVE_DEST_1: '/zzrkarch/zzrk/arch_1_95760.arc'
2、ARC1不能归档95760,和ARC0进程发生争用。
Fri Feb 25 04:24:15 2011
ARC1: Completed archiving log 1 thread 1 sequence 95761
ARC1: Evaluating archive log 2 thread 1 sequence 95760
ARC1: Unable to archive log 2 thread 1 sequence 95760
Log actively being archived by another process
3、由于不能归档,导致rman备份失败,进而导致归档目录慢,业务挂起
Mon Mar 14 04:20:09 2011
Errors in file /oracle/app/admin/zzrk/bdump/zzrk_arc1_3846274.trc:
ORA-19502: write error on file "/zzrkarch/zzrk/arch_1_97795.arc", blockno 114689 (blocksize=512)
ORA-27063: skgfospo: number of bytes read/written is incorrect
IBM AIX RISC System/6000 Error: 28: No space left on device
4、日志准备推进至98186,准备归档98184,未释放redo04.log
Thu Mar 17 04:26:54 2011
ARCH: Connecting to console port...
Thread 1 cannot allocate new log, sequence 98185
All online logs needed archiving
Current log# 4 seq# 98184 mem# 0: /zzrkdata/zzrk/redo04.log
。。。
Thu Mar 17 04:27:09 2011
Thread 1 advanced to log sequence 98185
Current log# 1 seq# 98185 mem# 0: /zzrkdata/zzrk/redo01.log
Thu Mar 17 04:27:09 2011
ARC1: Evaluating archive log 2 thread 1 sequence 95760
ARC1: Unable to archive log 2 thread 1 sequence 95760
Log actively being archived by another process
ARC1: Evaluating archive log 4 thread 1 sequence 98184
ARC1: Beginning to archive log 4 thread 1 sequence 98184
Creating archive destination LOG_ARCHIVE_DEST_1: '/zzrkarch/zzrk/arch_1_98184.arc'
5、由于归档失败,98185占用redo01.log,未能推进至98186
Thu Mar 17 04:27:56 2011
ORACLE Instance zzrk - Can not allocate log, archival required
Thu Mar 17 04:27:56 2011
ARCH: Connecting to console port...
Thread 1 cannot allocate new log, sequence 98186
All online logs needed archiving
Current log# 1 seq# 98185 mem# 0: /zzrkdata/zzrk/redo01.log
Thu Mar 17 09:52:12 2011
ARCH: Evaluating archive log 2 thread 1 sequence 95760
ARCH: Unable to archive log 2 thread 1 sequence 95760
Log actively being archived by another process
ARCH: Evaluating archive log 4 thread 1 sequence 98184
ARCH: Unable to archive log 4 thread 1 sequence 98184
Log actively being archived by another process
Thu Mar 17 09:53:41 2011
Shutting down instance: further logons disabled
Shutting down instance (immediate)
License high water mark = 360
6、手动添加日志文件,推进至98186,并占用redo03.log
Thu Mar 17 09:57:12 2011
/* OracleOEM */ ALTER DATABASE ADD LOGFILE THREAD 1 GROUP 3 ('/zzrkdata/zzrk/redo03.log') SIZE 102400K REUSE;
Thu Mar 17 09:57:17 2011
Thread 1 advanced to log sequence 98186
Current log# 3 seq# 98186 mem# 0: /zzrkdata/zzrk/redo03.log
7、手动添加数据文件,推进至98187,并占用redo05.log,未能推进至98188
Thread 1 cannot allocate new log, sequence 98187
All online logs needed archiving
Current log# 3 seq# 98186 mem# 0: /zzrkdata/zzrk/redo03.log
Thu Mar 17 09:57:45 2011
/* OracleOEM */ ALTER DATABASE ADD LOGFILE THREAD 1 GROUP 5 ('/zzrkdata/zzrk/redo05.log') SIZE 102400K
Thu Mar 17 09:57:50 2011
Thread 1 advanced to log sequence 98187
Current log# 5 seq# 98187 mem# 0: /zzrkdata/zzrk/redo05.log
。。。
Thu Mar 17 09:58:19 2011
ARCH: Connecting to console port...
Thread 1 cannot allocate new log, sequence 98188
8、98185归档成功,释放redo01.log资源,推进至98188,占用redo01.log
Thu Mar 17 10:04:13 2011
ARCH: Evaluating archive log 2 thread 1 sequence 95760
ARCH: Unable to archive log 2 thread 1 sequence 95760
Log actively being archived by another process
ARCH: Evaluating archive log 4 thread 1 sequence 98184
ARCH: Unable to archive log 4 thread 1 sequence 98184
Log actively being archived by another process
ARCH: Evaluating archive log 1 thread 1 sequence 98185
ARCH: Beginning to archive log 1 thread 1 sequence 98185
Creating archive destination LOG_ARCHIVE_DEST_1: '/zzrkarch/zzrk/arch_1_98185.arc'
ARCH: Completed archiving log 1 thread 1 sequence 98185
Thu Mar 17 10:04:19 2011
Thread 1 advanced to log sequence 98188
Current log# 1 seq# 98188 mem# 0: /zzrkdata/zzrk/redo01.log
9、98186,98187归档成功,释放redo03.log,redo05.log
Thu Mar 17 10:04:28 2011
ARCH: Completed archiving log 3 thread 1 sequence 98186
ARCH: Evaluating archive log 2 thread 1 sequence 95760
ARCH: Unable to archive log 2 thread 1 sequence 95760
Log actively being archived by another process
ARCH: Evaluating archive log 4 thread 1 sequence 98184
ARCH: Unable to archive log 4 thread 1 sequence 98184
Log actively being archived by another process
ARCH: Evaluating archive log 5 thread 1 sequence 98187
ARCH: Beginning to archive log 5 thread 1 sequence 98187
Creating archive destination LOG_ARCHIVE_DEST_1: '/zzrkarch/zzrk/arch_1_98187.arc'
ARCH: Completed archiving log 5 thread 1 sequence 98187
9、推进至98190。98189占用redo03.log,98190占用redo05.log。未能推进至98191
Thu Mar 17 10:04:38 2011
Thread 1 advanced to log sequence 98189
Current log# 3 seq# 98189 mem# 0: /zzrkdata/zzrk/redo03.log
Thu Mar 17 10:05:38 2011
ORACLE Instance zzrk - Can not allocate log, archival required
Thu Mar 17 10:05:38 2011
ARCH: Connecting to console port...
Thread 1 cannot allocate new log, sequence 98190
All online logs needed archiving
Current log# 3 seq# 98189 mem# 0: /zzrkdata/zzrk/redo03.log
Thu Mar 17 10:06:42 2011
Thread 1 advanced to log sequence 98190
Current log# 5 seq# 98190 mem# 0: /zzrkdata/zzrk/redo05.log
Thu Mar 17 10:07:10 2011
ORACLE Instance zzrk - Can not allocate log, archival required
Thu Mar 17 10:07:10 2011
ARCH: Connecting to console port...
Thread 1 cannot allocate new log, sequence 98191
All online logs needed archiving
Current log# 5 seq# 98190 mem# 0: /zzrkdata/zzrk/redo05.log
10、手动添加日志文件失败,
Thu Mar 17 10:08:14 2011
/* OracleOEM */ ALTER DATABASE ADD LOGFILE THREAD 1 GROUP 6 ('/zzrkdata/zzrk/redo06.log') SIZE 102400K
Thu Mar 17 10:08:14 2011
ORA-1185 signalled during: /* OracleOEM */ ALTER DATABASE ADD LOGFILE THREAD ...
默认情况下MAXLOGMEMBERS为5
CREATE CONTROLFILE REUSE DATABASE "ORA9I" NORESETLOGS NOARCHIVELOG
-- SET STANDBY TO MAXIMIZE PERFORMANCE
MAXLOGFILES 50
MAXLOGMEMBERS 5
MAXDATAFILES 100
MAXINSTANCES 1
MAXLOGHISTORY 2722
ora10g@localhost ~]$ oerr ora 1185
01185, 00000, "logfile group number %s is invalid"
// *Cause: An ALTER DATABASE ADD LOGFILE command specified a log number for
// the new log which is too large.
// *Action: Specify a correct logfile number.
11、尝试关闭数据库,失败。采用手工杀进程
Shutting down instance (abort)
License high water mark = 360
USER: terminating instance due to error 1092
Thu Mar 17 10:30:14 2011
USER: terminating instance due to error 1092
Thu Mar 17 10:30:14 2011
DIAG: terminating instance due to error 1092
Thu Mar 17 10:30:14 2011
LMD0: terminating instance due to error 1092
Thu Mar 17 10:30:14 2011
USER: terminating instance due to error 1092
Thu Mar 17 10:30:16 2011
USER: terminating instance due to error 1092
Thu Mar 17 10:30:16 2011
PMON: terminating instance due to error 1092
Thu Mar 17 10:30:33 2011
Instance terminated by DIAG, pid = 454866
[ora10g@localhost ~]$ oerr ora 1092
01092, 00000, "ORACLE instance terminated. Disconnection forced"
// *Cause: The instance this process was connected to was terminated
// abnormally, probably via a shutdown abort. This process
// was forced to disconnect from the instance.
// *Action: Examine the alert log for more details. When the instance has been
// restarted, retry action.
12、重启Oracle,数据库完成crash recover,但open阶段hang,如果事务恢复的话,磁盘会有相应活动信息,但通过iostat观察,磁盘没有任何io信息,于是决定数据库再次abort
Thu Mar 17 10:40:56 2011
Starting ORACLE instance (normal)
Thu Mar 17 10:40:56 2011
Global Enqueue Service Resources = 64, pool = 8
Thu Mar 17 10:40:56 2011
Global Enqueue Service Enqueues = 128
LICENSE_MAX_SESSION = 0
LICENSE_SESSIONS_WARNING = 0
GES IPC: Receivers 1 Senders 1
GES IPC: Buffers Receive 1000 Send 1430 Reserve 1000
GES IPC: Msg Size Regular 440 Batch 2048
SCN scheme 3
Using log_archive_dest parameter default value
LICENSE_MAX_USERS = 0
SYS auditing is disabled
Starting up ORACLE RDBMS Version: 9.2.0.4.0.
System parameters with non-default values:
processes = 600
timed_statistics = TRUE
shared_pool_size = 838860800
large_pool_size = 167772160
java_pool_size = 33554432
control_files = /zzrkdata/zzrk/control01.ctl
db_block_size = 16384
db_cache_size = 4009754624
compatible = 9.2.0.0.0
log_archive_start = TRUE
log_archive_dest_1 = location=/zzrkarch/zzrk
log_archive_format = arch_%t_%s.arc
db_file_multiblock_read_count= 32
fast_start_mttr_target = 300
undo_management = AUTO
undo_tablespace = UNDOTBS1
undo_retention = 10800
remote_login_passwordfile= EXCLUSIVE
db_domain =
instance_name = zzrk
hash_join_enabled = TRUE
background_dump_dest = /oracle/app/admin/zzrk/bdump
user_dump_dest = /oracle/app/admin/zzrk/udump
core_dump_dest = /oracle/app/admin/zzrk/cdump
sort_area_size = 10485760
db_name = zzrk
open_cursors = 1500
star_transformation_enabled= FALSE
query_rewrite_enabled = FALSE
pga_aggregate_target = 1024000000
Thu Mar 17 10:40:56 2011
cluster interconnect IPC version:Oracle UDP/IP
IPC Vendor 1 proto 2 Version 1.0
PMON started with pid=2
DIAG started with pid=3
LMON started with pid=4
LMD0 started with pid=5
DBW0 started with pid=6
LGWR started with pid=7
CKPT started with pid=8
SMON started with pid=9
RECO started with pid=10
Thu Mar 17 10:41:00 2011
ARCH: STARTING ARCH PROCESSES
ARC0 started with pid=11
ARC0: Archival started
ARC1 started with pid=12
ARC1: Archival started
Thu Mar 17 10:41:00 2011
ARCH: STARTING ARCH PROCESSES COMPLETE
Thu Mar 17 10:41:00 2011
ARC1: Thread not mounted
Thu Mar 17 10:41:00 2011
ARC0: Thread not mounted
Thu Mar 17 10:41:00 2011
ALTER DATABASE MOUNT
Thu Mar 17 10:41:01 2011
lmon registered with NM - instance id 1 (internal mem no 0)
Thu Mar 17 10:41:01 2011
Reconfiguration started
List of nodes: 0,
Global Resource Directory frozen
one node partition
Communication channels reestablished
Master broadcasted resource hash value bitmaps
Non-local Process blocks cleaned out
Resources and enqueues cleaned out
Resources remastered 0
set master node info
Submitted all remote-enqueue requests
Update rdomain variables
Dwn-cvts replayed, VALBLKs dubious
All grantable enqueues granted
0 GCS shadows traversed, 0 replayed, 0 unopened
Submitted all GCS remote-cache requests
Thu Mar 17 10:41:04 2011
Reconfiguration complete
Post SMON to start 1st pass IR
Thu Mar 17 10:41:08 2011
Successful mount of redo thread 1, with mount id 1306399872.
Thu Mar 17 10:41:08 2011
Database mounted in Exclusive Mode.
Completed: ALTER DATABASE MOUNT
Thu Mar 17 10:41:08 2011
ALTER DATABASE OPEN
Thu Mar 17 10:41:08 2011
Beginning crash recovery of 1 threads
Thu Mar 17 10:41:08 2011
Started first pass scan
Thu Mar 17 10:41:09 2011
Completed first pass scan
184006 redo blocks read, 7257 data blocks need recovery
Thu Mar 17 10:41:10 2011
Started recovery at
Thread 1: logseq 98190, block 20793, scn 0.0
Recovery of Online Redo Log: Thread 1 Group 5 Seq 98190 Reading mem 0
Mem# 0 errs 0: /zzrkdata/zzrk/redo05.log
Thu Mar 17 10:41:17 2011
Completed redo application
Thu Mar 17 10:41:51 2011
Ended recovery at
Thread 1: logseq 98190, block 204799, scn 2825.638644552
7257 data blocks read, 5528 data blocks written, 184006 redo blocks read
Crash recovery completed successfully
Thu Mar 17 10:41:52 2011
LGWR: Primary database is in CLUSTER CONSISTENT mode
Thu Mar 17 11:08:38 2011
操作系统查看ls -l /zzrkdata/zzrk/redo05.log出现hang
在数据库查看状态为
SQL> select * from v$log;
GROUP# THREAD# SEQUENCE# BYTES MEMBERS ARCHIVED
---------- ---------- ---------- ---------- ---------- ---------
STATUS FIRST_CHANGE#
------------------------------------------------ -------------
FIRST_TIME
------------------
1 1 98188 104857600 1 NO
INACTIVE 1.2134E+13
17-MAR-11
2 1 95760 104857600 1 NO
INACTIVE 1.2117E+13
25-FEB-11
GROUP# THREAD# SEQUENCE# BYTES MEMBERS ARCHIVED
---------- ---------- ---------- ---------- ---------- ---------
STATUS FIRST_CHANGE#
------------------------------------------------ -------------
FIRST_TIME
------------------
3 1 98189 104857600 1 NO
INACTIVE 1.2134E+13
17-MAR-11
4 1 98184 104857600 1 NO
INACTIVE 1.2134E+13
GROUP# THREAD# SEQUENCE# BYTES MEMBERS ARCHIVED
---------- ---------- ---------- ---------- ---------- ---------
STATUS FIRST_CHANGE#
------------------------------------------------ -------------
FIRST_TIME
------------------
17-MAR-11
5 1 98190 104857600 1 NO
INVALIDATED 1.2134E+13
17-MAR-11
13、郁闷的,只有重启主机,经过漫长的等待。。。。查看ls -l redo*是否正常。
再次启动数据库至mount状态,为保险起见,采用recover database;但是报如下错,这是因为上次启动时已经完成crash recover。
Thu Mar 17 12:34:47 2011
Media Recovery Start
Media Recovery failed with error 264
ORA-283 signalled during: ALTER DATABASE RECOVER database ...
[ora10g@localhost ~]$ oerr ora 264
00264, 00000, "no recovery required"
// *Cause: An attempt was made to perform media recovery on files that do not // need any type of recovery.
// *Action: Do not attempt to perform media recovery on the selected
// files. Check to see that the filenames were entered properly.
// If not, retry the command with the proper filenames.
14、接下来就是open数据库,进行tx recover
Thu Mar 17 12:35:26 2011
alter database open
Thu Mar 17 12:35:26 2011
LGWR: Primary database is in CLUSTER CONSISTENT mode
Thu Mar 17 12:35:26 2011
ARCH: Evaluating archive log 2 thread 1 sequence 95760
ARCH: Beginning to archive log 2 thread 1 sequence 95760
Creating archive destination LOG_ARCHIVE_DEST_1: '/zzrkarch/zzrk/arch_1_95760.arc'
ARCH: Completed archiving log 2 thread 1 sequence 95760
Thu Mar 17 12:35:31 2011
LGWR: Primary database is in CLUSTER CONSISTENT mode
Thread 1 advanced to log sequence 98191
Thread 1 opened at log sequence 98191
Current log# 2 seq# 98191 mem# 0: /zzrkdata/zzrk/redo02.log
Successful open of redo thread 1.
Thu Mar 17 12:35:32 2011
SMON: enabling cache recovery
Thu Mar 17 12:35:32 2011
ARC0: Evaluating archive log 4 thread 1 sequence 98184
ARC0: Beginning to archive log 4 thread 1 sequence 98184
Creating archive destination LOG_ARCHIVE_DEST_1: '/zzrkarch/zzrk/arch_1_98184.arc'
Thu Mar 17 12:35:33 2011
Undo Segment 1 Onlined
Undo Segment 2 Onlined
Undo Segment 3 Onlined
Undo Segment 4 Onlined
Undo Segment 5 Onlined
Undo Segment 6 Onlined
Undo Segment 7 Onlined
Undo Segment 8 Onlined
Undo Segment 9 Onlined
Undo Segment 10 Onlined
Successfully onlined Undo Tablespace 1.
Thu Mar 17 12:35:33 2011
SMON: enabling tx recovery
Thu Mar 17 12:35:33 2011
Database Characterset is ZHS16GBK
Thu Mar 17 12:35:34 2011
SMON: about to recover undo segment 16
SMON: about to recover undo segment 16
SMON: mark undo segment 16 as available
SMON: about to recover undo segment 16
SMON: mark undo segment 16 as available
SMON: about to recover undo segment 16
SMON: mark undo segment 16 as available
SMON: about to recover undo segment 16
SMON: mark undo segment 16 as available
Thu Mar 17 12:35:34 2011
replication_dependency_tracking turned off (no async multimaster replication found)
Thu Mar 17 12:35:34 2011
SMON: about to recover undo segment 16
SMON: mark undo segment 16 as available
SMON: about to recover undo segment 16
SMON: mark undo segment 16 as available
Thu Mar 17 12:35:35 2011
Completed: alter database open
Thu Mar 17 12:35:35 2011
SMON: about to recover undo segment 16
SMON: mark undo segment 16 as available
SMON: about to recover undo segment 16
SMON: mark undo segment 16 as available
。。。
SMON: Parallel transaction recovery tried
。。。
故障是解决了,但是郁闷的是最终通过重启主机来解决的。
数据库版本:9.2.0.4
操作系统版本:5200-04
1、日志推进至95761,准备归档95760,未释放redo02.log
Fri Feb 25 04:22:00 2011
Thread 1 advanced to log sequence 95760
Current log# 2 seq# 95760 mem# 0: /zzrkdata/zzrk/redo02.log
。。。
Thread 1 advanced to log sequence 95761
Current log# 1 seq# 95761 mem# 0: /zzrkdata/zzrk/redo01.log
Fri Feb 25 04:23:11 2011
ARC0: Evaluating archive log 2 thread 1 sequence 95760
ARC0: Beginning to archive log 2 thread 1 sequence 95760
Creating archive destination LOG_ARCHIVE_DEST_1: '/zzrkarch/zzrk/arch_1_95760.arc'
2、ARC1不能归档95760,和ARC0进程发生争用。
Fri Feb 25 04:24:15 2011
ARC1: Completed archiving log 1 thread 1 sequence 95761
ARC1: Evaluating archive log 2 thread 1 sequence 95760
ARC1: Unable to archive log 2 thread 1 sequence 95760
Log actively being archived by another process
3、由于不能归档,导致rman备份失败,进而导致归档目录慢,业务挂起
Mon Mar 14 04:20:09 2011
Errors in file /oracle/app/admin/zzrk/bdump/zzrk_arc1_3846274.trc:
ORA-19502: write error on file "/zzrkarch/zzrk/arch_1_97795.arc", blockno 114689 (blocksize=512)
ORA-27063: skgfospo: number of bytes read/written is incorrect
IBM AIX RISC System/6000 Error: 28: No space left on device
4、日志准备推进至98186,准备归档98184,未释放redo04.log
Thu Mar 17 04:26:54 2011
ARCH: Connecting to console port...
Thread 1 cannot allocate new log, sequence 98185
All online logs needed archiving
Current log# 4 seq# 98184 mem# 0: /zzrkdata/zzrk/redo04.log
。。。
Thu Mar 17 04:27:09 2011
Thread 1 advanced to log sequence 98185
Current log# 1 seq# 98185 mem# 0: /zzrkdata/zzrk/redo01.log
Thu Mar 17 04:27:09 2011
ARC1: Evaluating archive log 2 thread 1 sequence 95760
ARC1: Unable to archive log 2 thread 1 sequence 95760
Log actively being archived by another process
ARC1: Evaluating archive log 4 thread 1 sequence 98184
ARC1: Beginning to archive log 4 thread 1 sequence 98184
Creating archive destination LOG_ARCHIVE_DEST_1: '/zzrkarch/zzrk/arch_1_98184.arc'
5、由于归档失败,98185占用redo01.log,未能推进至98186
Thu Mar 17 04:27:56 2011
ORACLE Instance zzrk - Can not allocate log, archival required
Thu Mar 17 04:27:56 2011
ARCH: Connecting to console port...
Thread 1 cannot allocate new log, sequence 98186
All online logs needed archiving
Current log# 1 seq# 98185 mem# 0: /zzrkdata/zzrk/redo01.log
Thu Mar 17 09:52:12 2011
ARCH: Evaluating archive log 2 thread 1 sequence 95760
ARCH: Unable to archive log 2 thread 1 sequence 95760
Log actively being archived by another process
ARCH: Evaluating archive log 4 thread 1 sequence 98184
ARCH: Unable to archive log 4 thread 1 sequence 98184
Log actively being archived by another process
Thu Mar 17 09:53:41 2011
Shutting down instance: further logons disabled
Shutting down instance (immediate)
License high water mark = 360
6、手动添加日志文件,推进至98186,并占用redo03.log
Thu Mar 17 09:57:12 2011
/* OracleOEM */ ALTER DATABASE ADD LOGFILE THREAD 1 GROUP 3 ('/zzrkdata/zzrk/redo03.log') SIZE 102400K REUSE;
Thu Mar 17 09:57:17 2011
Thread 1 advanced to log sequence 98186
Current log# 3 seq# 98186 mem# 0: /zzrkdata/zzrk/redo03.log
7、手动添加数据文件,推进至98187,并占用redo05.log,未能推进至98188
Thread 1 cannot allocate new log, sequence 98187
All online logs needed archiving
Current log# 3 seq# 98186 mem# 0: /zzrkdata/zzrk/redo03.log
Thu Mar 17 09:57:45 2011
/* OracleOEM */ ALTER DATABASE ADD LOGFILE THREAD 1 GROUP 5 ('/zzrkdata/zzrk/redo05.log') SIZE 102400K
Thu Mar 17 09:57:50 2011
Thread 1 advanced to log sequence 98187
Current log# 5 seq# 98187 mem# 0: /zzrkdata/zzrk/redo05.log
。。。
Thu Mar 17 09:58:19 2011
ARCH: Connecting to console port...
Thread 1 cannot allocate new log, sequence 98188
8、98185归档成功,释放redo01.log资源,推进至98188,占用redo01.log
Thu Mar 17 10:04:13 2011
ARCH: Evaluating archive log 2 thread 1 sequence 95760
ARCH: Unable to archive log 2 thread 1 sequence 95760
Log actively being archived by another process
ARCH: Evaluating archive log 4 thread 1 sequence 98184
ARCH: Unable to archive log 4 thread 1 sequence 98184
Log actively being archived by another process
ARCH: Evaluating archive log 1 thread 1 sequence 98185
ARCH: Beginning to archive log 1 thread 1 sequence 98185
Creating archive destination LOG_ARCHIVE_DEST_1: '/zzrkarch/zzrk/arch_1_98185.arc'
ARCH: Completed archiving log 1 thread 1 sequence 98185
Thu Mar 17 10:04:19 2011
Thread 1 advanced to log sequence 98188
Current log# 1 seq# 98188 mem# 0: /zzrkdata/zzrk/redo01.log
9、98186,98187归档成功,释放redo03.log,redo05.log
Thu Mar 17 10:04:28 2011
ARCH: Completed archiving log 3 thread 1 sequence 98186
ARCH: Evaluating archive log 2 thread 1 sequence 95760
ARCH: Unable to archive log 2 thread 1 sequence 95760
Log actively being archived by another process
ARCH: Evaluating archive log 4 thread 1 sequence 98184
ARCH: Unable to archive log 4 thread 1 sequence 98184
Log actively being archived by another process
ARCH: Evaluating archive log 5 thread 1 sequence 98187
ARCH: Beginning to archive log 5 thread 1 sequence 98187
Creating archive destination LOG_ARCHIVE_DEST_1: '/zzrkarch/zzrk/arch_1_98187.arc'
ARCH: Completed archiving log 5 thread 1 sequence 98187
9、推进至98190。98189占用redo03.log,98190占用redo05.log。未能推进至98191
Thu Mar 17 10:04:38 2011
Thread 1 advanced to log sequence 98189
Current log# 3 seq# 98189 mem# 0: /zzrkdata/zzrk/redo03.log
Thu Mar 17 10:05:38 2011
ORACLE Instance zzrk - Can not allocate log, archival required
Thu Mar 17 10:05:38 2011
ARCH: Connecting to console port...
Thread 1 cannot allocate new log, sequence 98190
All online logs needed archiving
Current log# 3 seq# 98189 mem# 0: /zzrkdata/zzrk/redo03.log
Thu Mar 17 10:06:42 2011
Thread 1 advanced to log sequence 98190
Current log# 5 seq# 98190 mem# 0: /zzrkdata/zzrk/redo05.log
Thu Mar 17 10:07:10 2011
ORACLE Instance zzrk - Can not allocate log, archival required
Thu Mar 17 10:07:10 2011
ARCH: Connecting to console port...
Thread 1 cannot allocate new log, sequence 98191
All online logs needed archiving
Current log# 5 seq# 98190 mem# 0: /zzrkdata/zzrk/redo05.log
10、手动添加日志文件失败,
Thu Mar 17 10:08:14 2011
/* OracleOEM */ ALTER DATABASE ADD LOGFILE THREAD 1 GROUP 6 ('/zzrkdata/zzrk/redo06.log') SIZE 102400K
Thu Mar 17 10:08:14 2011
ORA-1185 signalled during: /* OracleOEM */ ALTER DATABASE ADD LOGFILE THREAD ...
默认情况下MAXLOGMEMBERS为5
CREATE CONTROLFILE REUSE DATABASE "ORA9I" NORESETLOGS NOARCHIVELOG
-- SET STANDBY TO MAXIMIZE PERFORMANCE
MAXLOGFILES 50
MAXLOGMEMBERS 5
MAXDATAFILES 100
MAXINSTANCES 1
MAXLOGHISTORY 2722
ora10g@localhost ~]$ oerr ora 1185
01185, 00000, "logfile group number %s is invalid"
// *Cause: An ALTER DATABASE ADD LOGFILE command specified a log number for
// the new log which is too large.
// *Action: Specify a correct logfile number.
11、尝试关闭数据库,失败。采用手工杀进程
Shutting down instance (abort)
License high water mark = 360
USER: terminating instance due to error 1092
Thu Mar 17 10:30:14 2011
USER: terminating instance due to error 1092
Thu Mar 17 10:30:14 2011
DIAG: terminating instance due to error 1092
Thu Mar 17 10:30:14 2011
LMD0: terminating instance due to error 1092
Thu Mar 17 10:30:14 2011
USER: terminating instance due to error 1092
Thu Mar 17 10:30:16 2011
USER: terminating instance due to error 1092
Thu Mar 17 10:30:16 2011
PMON: terminating instance due to error 1092
Thu Mar 17 10:30:33 2011
Instance terminated by DIAG, pid = 454866
[ora10g@localhost ~]$ oerr ora 1092
01092, 00000, "ORACLE instance terminated. Disconnection forced"
// *Cause: The instance this process was connected to was terminated
// abnormally, probably via a shutdown abort. This process
// was forced to disconnect from the instance.
// *Action: Examine the alert log for more details. When the instance has been
// restarted, retry action.
12、重启Oracle,数据库完成crash recover,但open阶段hang,如果事务恢复的话,磁盘会有相应活动信息,但通过iostat观察,磁盘没有任何io信息,于是决定数据库再次abort
Thu Mar 17 10:40:56 2011
Starting ORACLE instance (normal)
Thu Mar 17 10:40:56 2011
Global Enqueue Service Resources = 64, pool = 8
Thu Mar 17 10:40:56 2011
Global Enqueue Service Enqueues = 128
LICENSE_MAX_SESSION = 0
LICENSE_SESSIONS_WARNING = 0
GES IPC: Receivers 1 Senders 1
GES IPC: Buffers Receive 1000 Send 1430 Reserve 1000
GES IPC: Msg Size Regular 440 Batch 2048
SCN scheme 3
Using log_archive_dest parameter default value
LICENSE_MAX_USERS = 0
SYS auditing is disabled
Starting up ORACLE RDBMS Version: 9.2.0.4.0.
System parameters with non-default values:
processes = 600
timed_statistics = TRUE
shared_pool_size = 838860800
large_pool_size = 167772160
java_pool_size = 33554432
control_files = /zzrkdata/zzrk/control01.ctl
db_block_size = 16384
db_cache_size = 4009754624
compatible = 9.2.0.0.0
log_archive_start = TRUE
log_archive_dest_1 = location=/zzrkarch/zzrk
log_archive_format = arch_%t_%s.arc
db_file_multiblock_read_count= 32
fast_start_mttr_target = 300
undo_management = AUTO
undo_tablespace = UNDOTBS1
undo_retention = 10800
remote_login_passwordfile= EXCLUSIVE
db_domain =
instance_name = zzrk
hash_join_enabled = TRUE
background_dump_dest = /oracle/app/admin/zzrk/bdump
user_dump_dest = /oracle/app/admin/zzrk/udump
core_dump_dest = /oracle/app/admin/zzrk/cdump
sort_area_size = 10485760
db_name = zzrk
open_cursors = 1500
star_transformation_enabled= FALSE
query_rewrite_enabled = FALSE
pga_aggregate_target = 1024000000
Thu Mar 17 10:40:56 2011
cluster interconnect IPC version:Oracle UDP/IP
IPC Vendor 1 proto 2 Version 1.0
PMON started with pid=2
DIAG started with pid=3
LMON started with pid=4
LMD0 started with pid=5
DBW0 started with pid=6
LGWR started with pid=7
CKPT started with pid=8
SMON started with pid=9
RECO started with pid=10
Thu Mar 17 10:41:00 2011
ARCH: STARTING ARCH PROCESSES
ARC0 started with pid=11
ARC0: Archival started
ARC1 started with pid=12
ARC1: Archival started
Thu Mar 17 10:41:00 2011
ARCH: STARTING ARCH PROCESSES COMPLETE
Thu Mar 17 10:41:00 2011
ARC1: Thread not mounted
Thu Mar 17 10:41:00 2011
ARC0: Thread not mounted
Thu Mar 17 10:41:00 2011
ALTER DATABASE MOUNT
Thu Mar 17 10:41:01 2011
lmon registered with NM - instance id 1 (internal mem no 0)
Thu Mar 17 10:41:01 2011
Reconfiguration started
List of nodes: 0,
Global Resource Directory frozen
one node partition
Communication channels reestablished
Master broadcasted resource hash value bitmaps
Non-local Process blocks cleaned out
Resources and enqueues cleaned out
Resources remastered 0
set master node info
Submitted all remote-enqueue requests
Update rdomain variables
Dwn-cvts replayed, VALBLKs dubious
All grantable enqueues granted
0 GCS shadows traversed, 0 replayed, 0 unopened
Submitted all GCS remote-cache requests
Thu Mar 17 10:41:04 2011
Reconfiguration complete
Post SMON to start 1st pass IR
Thu Mar 17 10:41:08 2011
Successful mount of redo thread 1, with mount id 1306399872.
Thu Mar 17 10:41:08 2011
Database mounted in Exclusive Mode.
Completed: ALTER DATABASE MOUNT
Thu Mar 17 10:41:08 2011
ALTER DATABASE OPEN
Thu Mar 17 10:41:08 2011
Beginning crash recovery of 1 threads
Thu Mar 17 10:41:08 2011
Started first pass scan
Thu Mar 17 10:41:09 2011
Completed first pass scan
184006 redo blocks read, 7257 data blocks need recovery
Thu Mar 17 10:41:10 2011
Started recovery at
Thread 1: logseq 98190, block 20793, scn 0.0
Recovery of Online Redo Log: Thread 1 Group 5 Seq 98190 Reading mem 0
Mem# 0 errs 0: /zzrkdata/zzrk/redo05.log
Thu Mar 17 10:41:17 2011
Completed redo application
Thu Mar 17 10:41:51 2011
Ended recovery at
Thread 1: logseq 98190, block 204799, scn 2825.638644552
7257 data blocks read, 5528 data blocks written, 184006 redo blocks read
Crash recovery completed successfully
Thu Mar 17 10:41:52 2011
LGWR: Primary database is in CLUSTER CONSISTENT mode
Thu Mar 17 11:08:38 2011
操作系统查看ls -l /zzrkdata/zzrk/redo05.log出现hang
在数据库查看状态为
SQL> select * from v$log;
GROUP# THREAD# SEQUENCE# BYTES MEMBERS ARCHIVED
---------- ---------- ---------- ---------- ---------- ---------
STATUS FIRST_CHANGE#
------------------------------------------------ -------------
FIRST_TIME
------------------
1 1 98188 104857600 1 NO
INACTIVE 1.2134E+13
17-MAR-11
2 1 95760 104857600 1 NO
INACTIVE 1.2117E+13
25-FEB-11
GROUP# THREAD# SEQUENCE# BYTES MEMBERS ARCHIVED
---------- ---------- ---------- ---------- ---------- ---------
STATUS FIRST_CHANGE#
------------------------------------------------ -------------
FIRST_TIME
------------------
3 1 98189 104857600 1 NO
INACTIVE 1.2134E+13
17-MAR-11
4 1 98184 104857600 1 NO
INACTIVE 1.2134E+13
GROUP# THREAD# SEQUENCE# BYTES MEMBERS ARCHIVED
---------- ---------- ---------- ---------- ---------- ---------
STATUS FIRST_CHANGE#
------------------------------------------------ -------------
FIRST_TIME
------------------
17-MAR-11
5 1 98190 104857600 1 NO
INVALIDATED 1.2134E+13
17-MAR-11
13、郁闷的,只有重启主机,经过漫长的等待。。。。查看ls -l redo*是否正常。
再次启动数据库至mount状态,为保险起见,采用recover database;但是报如下错,这是因为上次启动时已经完成crash recover。
Thu Mar 17 12:34:47 2011
Media Recovery Start
Media Recovery failed with error 264
ORA-283 signalled during: ALTER DATABASE RECOVER database ...
[ora10g@localhost ~]$ oerr ora 264
00264, 00000, "no recovery required"
// *Cause: An attempt was made to perform media recovery on files that do not // need any type of recovery.
// *Action: Do not attempt to perform media recovery on the selected
// files. Check to see that the filenames were entered properly.
// If not, retry the command with the proper filenames.
14、接下来就是open数据库,进行tx recover
Thu Mar 17 12:35:26 2011
alter database open
Thu Mar 17 12:35:26 2011
LGWR: Primary database is in CLUSTER CONSISTENT mode
Thu Mar 17 12:35:26 2011
ARCH: Evaluating archive log 2 thread 1 sequence 95760
ARCH: Beginning to archive log 2 thread 1 sequence 95760
Creating archive destination LOG_ARCHIVE_DEST_1: '/zzrkarch/zzrk/arch_1_95760.arc'
ARCH: Completed archiving log 2 thread 1 sequence 95760
Thu Mar 17 12:35:31 2011
LGWR: Primary database is in CLUSTER CONSISTENT mode
Thread 1 advanced to log sequence 98191
Thread 1 opened at log sequence 98191
Current log# 2 seq# 98191 mem# 0: /zzrkdata/zzrk/redo02.log
Successful open of redo thread 1.
Thu Mar 17 12:35:32 2011
SMON: enabling cache recovery
Thu Mar 17 12:35:32 2011
ARC0: Evaluating archive log 4 thread 1 sequence 98184
ARC0: Beginning to archive log 4 thread 1 sequence 98184
Creating archive destination LOG_ARCHIVE_DEST_1: '/zzrkarch/zzrk/arch_1_98184.arc'
Thu Mar 17 12:35:33 2011
Undo Segment 1 Onlined
Undo Segment 2 Onlined
Undo Segment 3 Onlined
Undo Segment 4 Onlined
Undo Segment 5 Onlined
Undo Segment 6 Onlined
Undo Segment 7 Onlined
Undo Segment 8 Onlined
Undo Segment 9 Onlined
Undo Segment 10 Onlined
Successfully onlined Undo Tablespace 1.
Thu Mar 17 12:35:33 2011
SMON: enabling tx recovery
Thu Mar 17 12:35:33 2011
Database Characterset is ZHS16GBK
Thu Mar 17 12:35:34 2011
SMON: about to recover undo segment 16
SMON: about to recover undo segment 16
SMON: mark undo segment 16 as available
SMON: about to recover undo segment 16
SMON: mark undo segment 16 as available
SMON: about to recover undo segment 16
SMON: mark undo segment 16 as available
SMON: about to recover undo segment 16
SMON: mark undo segment 16 as available
Thu Mar 17 12:35:34 2011
replication_dependency_tracking turned off (no async multimaster replication found)
Thu Mar 17 12:35:34 2011
SMON: about to recover undo segment 16
SMON: mark undo segment 16 as available
SMON: about to recover undo segment 16
SMON: mark undo segment 16 as available
Thu Mar 17 12:35:35 2011
Completed: alter database open
Thu Mar 17 12:35:35 2011
SMON: about to recover undo segment 16
SMON: mark undo segment 16 as available
SMON: about to recover undo segment 16
SMON: mark undo segment 16 as available
。。。
SMON: Parallel transaction recovery tried
。。。
故障是解决了,但是郁闷的是最终通过重启主机来解决的。
发表评论
-
buffer cache 的内部结构
2020-03-18 14:21 579BUFFER CACHE作为数据块的 ... -
Oracle OMC介绍
2020-03-18 13:19 487Oracle管理云服务(OMC)的大数据平台,自动收集的企业 ... -
参加Oracle勒索病毒防范专题培训会议
2019-09-27 17:15 5132019年7月22日,受邀参加Oracle勒索病毒防范专题培训 ... -
记一次内存换IO的Oracle优化
2019-09-27 16:50 827某客户数据库从P595物理 ... -
如何定位Oracle SQL执行计划变化的原因
2019-07-03 14:49 1460性能优化最难的是能够 ... -
如何定位Oracle SQL执行计划变化的原因
2018-10-30 09:24 1185性能优化最难的是能够 ... -
数据库性能优化目标
2018-10-08 10:59 518从数据库性能优化的场 ... -
数据库无法打开的原因及解决办法
2018-10-05 20:45 2120数据库的启动是一个相当复杂的过程。比如,Oracle在启动之前 ... -
怎么样彻底删除数据库?
2018-09-18 11:10 599Oracle提供了drop database命令用来删除数据库 ... -
Oracle减少日志量的方法
2018-09-10 10:17 867LGWR进程将LOG BUFFER中的 ... -
如何快速关闭数据库
2018-09-09 13:14 1233“一朝被蛇咬,十年怕井绳”。在没被“蛇”咬之前,很多DBA喜欢 ... -
关于《如何落地智能化运维》PPT
2018-05-17 10:19 1129在DTCC 2018发表《如何落地智能化运维》演讲,主要内容如 ... -
记录在redhat5.8平台安装oracle11.2容易忽视的几个问题
2018-05-11 19:58 578问题一:ping不通问题 在虚拟机上安装好linux系统后, ... -
《Oracle DBA实战攻略》第一章
2018-05-11 10:42 947即日起,不定期更新《OracleDBA实战攻略》一书电子版,请 ... -
Oracle 12c新特性
2018-05-11 10:33 900查询所有pdb [oracle@gj4 ~]$ sqlplu ... -
关于修改memory_target的值后数据库无法启动的问题
2017-02-28 12:24 3983操作系统:RHEL6.5 数据库版本:11.2.0.4 ... -
10g rac安装error while loading shared libraries libpthread.so.0 问题
2017-02-28 12:22 69511g rac安装在二节点跑脚本一般会报此错误: 解决这个问 ... -
记一次Oracle会话共享模式故障处理过程
2017-02-27 19:16 801故障简述 XXX第八人民医院HIS数据库7月13日11点左右从 ... -
RESMGR:cpu quantum等待事件处理过程
2017-02-27 18:23 2615由于数据库上线过程中出现大量的RESMGR:cpu quant ... -
谈谈log file sync
2014-03-19 14:18 1759数据库中的log file sync等待事件指的是,当user ...
相关推荐
如果一次尝试未解决问题,需返回上一步重新考虑其他可能的解决方案。 6. 服务恢复:当故障被解决后,需要验证服务器是否已恢复正常工作。这包括测试关键服务的可用性、性能和稳定性。 7. 记录与报告:整理整个故障...
- **描述**:本报告详细记录了一次IT故障事件的处理流程,包括事件的发生、影响范围、处理过程以及最终解决方案。旨在为IT团队提供故障处理的参考案例。 #### 二、事件详情 - **事件描述**: - **用户反馈**:...
这种日常检查应至少每日一次,以确保所有设备设施处于良好状态。同时,检查内容应全面详实,包括设备的使用状况、环境条件和可能存在的问题。 其次,文档提醒维护人员需关注消防设备设施的保养与更换,尤其是那些易...
- **一组完整记录**:由两个寄存器(D)组成,例如D6200-D6201记录最近一次故障,D6202-D6203记录上一次故障等。 - **查看方式**: - 使用GHE-FMT板小键盘查看,具体操作方法参见《GHE-FMT板使用说明书》(文件号:S...
变频器现场调试与故障处理是工业电气设备维护和故障诊断中的一项重要技能,其主要涉及变频器在实际运行过程中可能遇到的各种故障的排查与修复。变频器是一种电力控制设备,它通过改变电机供电频率和电压来调节电机...
对于这类问题,需要对一次设备、保护动作信号等进行进一步检查,以明确具体原因。 除了线路和硬件问题,变电设备检查的不到位也可能造成跳闸故障。由于变电设备规模庞大,其日常检查与维护工作量繁重,这可能导致...
11. **故障处理**:发现问题时,检查人员须报告,并填写故障处理记录。问题应按紧急程度在规定时间内解决,处理结果需由消防安全责任人确认。 这些规定确保了建筑消防设施的高效运行,有效预防火灾风险。任何环节的...
Oracle MySQL数据库故障处理应急方案是一系列针对数据库在运行过程中可能遇到的锁等待、latchfree、cachebufferchains以及librarycachelock等高资源消耗和CPU负载问题的处理方法和步骤。该方案详细描述了上述问题的...
作者黄敏堂通过分析上思站一次自动站仪器故障的实际案例,分享了处理此类问题的经验和技巧,旨在帮助观测员提升故障处理能力。 【知识点详解】 1. 故障诊断:自动站仪器故障通常通过观察实时数据和状态的异常来...
在工业自动化领域,西门子S7-1200 PLC是广泛应用的控制器之一,而IO设备(输入/输出设备)则是与PLC交互的重要组成部分。当S7-1200 PLC出现“IO设备故障”报警,但在实际检查中所有IO模块都显示正常时,这通常意味着...
此外,每年应进行一次系统绝缘电阻及接地电阻的检查测试,对逆变控制部分进行一次电能质量和保护功能的检查和试验。 总结来说,分布式光伏发电系统的运行维护和故障处理是一项综合性的技术活动,需要定期地对系统...
- 在每次使用前,都需要进行一次全面的检查,确认设备可以正常工作。 - 特别注意检查电源线、插头等外部连接部件是否完好,避免因为这些简单问题导致设备无法启动。 #### 三、应急处理流程 1. **电源及线路故障*...
运维故障处理不仅是一项技术挑战,更是对运维团队整体协作能力的一次考验。通过上述讨论,我们可以看出,从日常的准备、故障发生时的快速响应到未来的智能化趋势,每一个环节都至关重要。只有建立起一套完整的运维...
文档“风机通讯故障处理.doc”记录了一次针对巴里坤风电场S4-9机组通讯故障的详细排查和处理过程。故障始于2021年4月16日,表现为41、42、43子站总线故障,主故障为41子站总线故障,伴随叶片数据丧失。在故障发生后...
查看事件记录的操作为M-1-2-1,而查看上一次接通电源前未清除的故障记录则是M-2-2-2。同样,清除故障的方法是按下SHIFT+5,然后按照提示确认。 4. OVF20电梯,通过M-2-2-1可以查看从上次电源接通到现在的所有事件,...
传输系统故障处理流程是保障通信网络稳定运行的关键环节,尤其对于一级、二级干线光缆传输系统,一旦出现重大故障或阻断,责任单位必须迅速向上级汇报并采取行动。业务主管应当亲临现场,由有业务领导权的单位统一...
1.2 日常巡检-基本原则 采用唯一的标识或名称 创建问题解决记录日志 一次只做一个改动,并记录结果 使用华为提供的工具、资源、软件 清楚操作系统和应用软件的更新 制定可靠的备份计划 现场保留备件,故障时及时...
在故障分析的基础上,进行故障处理时需要遵循一系列的方法和步骤,这不仅能够提高处理效率,还能最大程度地降低对铁路运输的影响。处理流程如下: 1. 立即响应:一旦检测到系统故障,迅速启动应急方案,确保行车...