Главная страница


ru.unix.bsd

 
 - RU.UNIX.BSD ------------------------------------------------------------------
 From : Eugene Grosbein                      2:5006/1       21 Feb 2005  18:52:55
 To : All
 Subject : scsi trouble
 -------------------------------------------------------------------------------- 
 
 Привет!
 
 Зайдя удаленно на 5.3-STABLE, запустил dump раздела /dev/da0s1d
 (занято 26Gb) в файл, лежащий на диске ATA. Hекоторое время dump работал,
 потом выдал кучу ошибок (device not configured) и предложил прекратить
 работу. После этого корневой каталог точки монтирования этого раздела
 выглядил пустым. Попытался отмонтировать раздел (его никто не использует) -
 потерял управление. Позвонил, тачку перегрузили питанием. Зашел,
 данные на месте. В логах (они пишутся на ATA) следующее:
 
 Feb 21 16:59:17 db kernel: ahd0: Recovery Initiated - Card was not paused
 Feb 21 16:59:17 db kernel: >>>>>>>>>>>>>>>>>> Dump Card State Begins
 <<<<<<<<<<<<<<<<<
 Feb 21 16:59:17 db kernel: ahd0: Dumping Card State at program address 0xaa
 Mode 0x33
 Feb 21 16:59:17 db kernel: INTSTAT[0x0] SELOID[0x0] SELID[0x0] HS_MAILBOX[0x0] 
 Feb 21 16:59:17 db kernel: INTCTL[0x80] SEQINTSTAT[0x0] SAVED_MODE[0x11]
 DFFSTAT[0x31] 
 Feb 21 16:59:17 db kernel: SCSISIGI[0x0] SCSIPHASE[0x0] SCSIBUS[0x0]
 LASTPHASE[0x1] 
 Feb 21 16:59:17 db kernel: SCSISEQ0[0x0] SCSISEQ1[0x12] SEQCTL0[0x10]
 SEQINTCTL[0x80] 
 Feb 21 16:59:17 db kernel: SEQ_FLAGS[0x0] SEQ_FLAGS2[0x0] QFREEZE_COUNT[0x1] 
 Feb 21 16:59:17 db kernel: KERNEL_QFREEZE_COUNT[0x1] MK_MESSAGE_SCB[0xff00]
 MK_MESSAGE_SCSIID[0xff] 
 Feb 21 16:59:17 db kernel: SSTAT0[0x0] SSTAT1[0x8] SSTAT2[0x0] SSTAT3[0x0]
 PERRDIAG[0x8] 
 Feb 21 16:59:17 db kernel: SIMODE1[0xa4] LQISTAT0[0x0] LQISTAT1[0x0]
 LQISTAT2[0x0] 
 Feb 21 16:59:17 db kernel: LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0x1] 
 Feb 21 16:59:17 db kernel: 
 Feb 21 16:59:17 db kernel: SCB Count = 32 CMDS_PENDING = 4 LASTSCB 0x7 CURRSCB
 0x7 NEXTSCB 0xffc0
 Feb 21 16:59:17 db kernel: qinstart = 45961 qinfifonext = 45961
 Feb 21 16:59:17 db kernel: QINFIFO:
 Feb 21 16:59:17 db kernel: WAITING_TID_QUEUES:
 Feb 21 16:59:17 db kernel: Pending list:
 Feb 21 16:59:17 db kernel: 7 FIFO_USE[0x0] SCB_CONTROL[0x62] SCB_SCSIID[0x7] 
 Feb 21 16:59:17 db kernel: 2 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_SCSIID[0x7] 
 Feb 21 16:59:17 db kernel: 5 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_SCSIID[0x7] 
 Feb 21 16:59:17 db kernel: 8 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_SCSIID[0x7] 
 Feb 21 16:59:17 db kernel: Total 4
 Feb 21 16:59:17 db kernel: Kernel Free SCB list: 28 13 29 27 14 9 31 12 15 1 11
 0 10 30 4 6 3 26 25 24 23 22 21 20 19 18 17 16 
 Feb 21 16:59:17 db kernel: Sequencer Complete DMA-inprog list: 
 Feb 21 16:59:17 db kernel: Sequencer Complete list: 
 Feb 21 16:59:17 db kernel: Sequencer DMA-Up and Complete list: 
 Feb 21 16:59:17 db kernel: Sequencer On QFreeze and Complete list: 
 Feb 21 16:59:17 db kernel: 
 Feb 21 16:59:17 db kernel: 
 Feb 21 16:59:17 db kernel: ahd0: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x5
 Feb 21 16:59:17 db kernel: SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0]
 DFSTATUS[0x89] 
 Feb 21 16:59:17 db kernel: SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] 
 Feb 21 16:59:17 db kernel: SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT =
 0x0 
 Feb 21 16:59:17 db kernel: HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10] 
 Feb 21 16:59:17 db kernel: 
 Feb 21 16:59:17 db kernel: ahd0: FIFO1 Free, LONGJMP == 0x8285, SCB 0x5
 Feb 21 16:59:17 db kernel: SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0]
 DFSTATUS[0x89] 
 Feb 21 16:59:17 db kernel: SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] 
 Feb 21 16:59:17 db kernel: SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT =
 0x0 
 Feb 21 16:59:17 db kernel: HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10] 
 Feb 21 16:59:17 db kernel: LQIN: 0x55 0x0 0x0 0x5 0x0 0x0 0x0 0x0 0x0 0x0 0x0
 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 
 Feb 21 16:59:17 db kernel: ahd0: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE =
 0x42
 Feb 21 16:59:17 db kernel: ahd0: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x1
 Feb 21 16:59:17 db kernel: ahd0: SAVED_SCSIID = 0x0 SAVED_LUN = 0x0
 Feb 21 16:59:17 db kernel: SIMODE0[0xc] 
 Feb 21 16:59:17 db kernel: CCSCBCTL[0x4] 
 Feb 21 16:59:17 db kernel: ahd0: REG0 == 0x8, SINDEX = 0x133, DINDEX = 0x102
 Feb 21 16:59:17 db kernel: ahd0: SCBPTR == 0x7, SCB_NEXT == 0xffc0, SCB_NEXT2
 == 0xff25
 Feb 21 16:59:17 db kernel: CDB 28 0 0 37 c7 ff
 Feb 21 16:59:17 db kernel: STACK: 0x1 0x140 0x0 0x0 0x27e 0x285 0xa8 0x39
 Feb 21 16:59:17 db kernel: <<<<<<<<<<<<<<<<< Dump Card State Ends
 
 >>>>>>>>>>>>>>>>>>
 
 Feb 21 16:59:17 db kernel: (da0:ahd0:0:0:0): SCB 0x2 - timed out
 Feb 21 16:59:17 db kernel: (da0:ahd0:0:0:0): Other SCB Timeout
 Feb 21 16:59:17 db kernel: (da0:ahd0:0:0:0): SCB 0x5 - timed out
 Feb 21 16:59:17 db kernel: (da0:ahd0:0:0:0): Other SCB Timeout
 Feb 21 16:59:17 db kernel: (da0:ahd0:0:0:0): SCB 0x8 - timed out
 Feb 21 16:59:17 db kernel: (da0:ahd0:0:0:0): Other SCB Timeout
 
 И такого добра около мегабайта в лог высыпало в течение 14 минут.
 В конце написало:
 
 Feb 21 17:13:15 db kernel: (da0:ahd0:0:0:0): SCB 0x37 - timed out
 Feb 21 17:13:15 db kernel: (da0:ahd0:0:0:0): no longer in timeout, status = 24b
 Feb 21 17:13:15 db kernel: ahd0: Issued Channel A Bus Reset. 4 SCBs aborted
 Feb 21 17:13:18 db kernel: Copied 18 bytes of sense data offset 12: 0x70 0x0
 0x6 0x0 0x0 0x0 0x0 0xa 0x0 0x0 0x0 0x0 0x29 0x2 0x2 0x0 0x0 0x0
 Feb 21 17:13:18 db kernel: (da0:ahd0:0:0:0): READ(10). CDB: 28 0 1 56 e 6b 0 0
 4 0 
 Feb 21 17:13:18 db kernel: (da0:ahd0:0:0:0): CAM Status: SCSI Status Error
 Feb 21 17:13:18 db kernel: (da0:ahd0:0:0:0): SCSI Status: Check Condition
 Feb 21 17:13:18 db kernel: (da0:ahd0:0:0:0): UNIT ATTENTION asc:29,2
 Feb 21 17:13:18 db kernel: (da0:ahd0:0:0:0): Scsi bus reset occurred field
 replaceable unit: 2
 Feb 21 17:13:18 db kernel: (da0:ahd0:0:0:0): Retrying Command (per Sense Data)
 Feb 21 17:13:44 db kernel: Removing MK_MSG scb
 Feb 21 17:13:47 db last message repeated 3 times
 Feb 21 17:13:47 db kernel: (da0:ahd0:0:0:0): lost device
 Feb 21 17:13:47 db kernel: (da0:ahd0:0:0:0): Invalidating pack
 Feb 21 17:13:57 db last message repeated 2 times
 
 После чего систему перегрузили питанием. Что это было?
 
 Eugene
 --- slrn/0.9.8.0 (FreeBSD)
  * Origin: Svyaz Service JSC (2:5006/1@fidonet)
 
 

Вернуться к списку тем, сортированных по: возрастание даты  уменьшение даты  тема  автор 

 Тема:    Автор:    Дата:  
 scsi trouble   Eugene Grosbein   21 Feb 2005 18:52:55 
 Re: scsi trouble   Eugene Grosbein   21 Feb 2005 19:12:50 
 Re: scsi trouble   Eugene Grosbein   21 Feb 2005 20:31:38 
 Re: scsi trouble   Denis Shaposhnikov   21 Feb 2005 17:45:44 
 scsi trouble   Slawa Olhovchenkov   21 Feb 2005 18:20:06 
 Re: scsi trouble   Eugene Grosbein   22 Feb 2005 10:03:36 
 scsi trouble   Slawa Olhovchenkov   22 Feb 2005 13:42:54 
 Re: scsi trouble   Eugene Grosbein   22 Feb 2005 18:02:01 
 scsi trouble   Slawa Olhovchenkov   22 Feb 2005 14:21:58 
 Re: scsi trouble   Eugene Grosbein   22 Feb 2005 18:39:05 
 Re: scsi trouble   Eugene Grosbein   22 Feb 2005 18:42:17 
 scsi trouble   Alexandr Oskolkov   21 Feb 2005 22:06:16 
Архивное /ru.unix.bsd/2609378d340d9.html, оценка 3 из 5, голосов 10
Яндекс.Метрика
Valid HTML 4.01 Transitional