EDN logo


Design Feature: November 23, 1994

SCSI-based RAID systems provide storage redundancy, enhance data-transfer rate

Bill Travis,
Senior Technical Editor

RAID systems for PCs can greatly increase storage MTBF, accelerate read/write data-transfer rates, or both.

If you worry about losing data through hard-disk failures, RAID can put your mind at ease. RAID stands for (take your choice) redundant array of inexpensive/independent/-individual disks/drives. RAID techniques can also accelerate disk-drive data-transfer rates for such speed-intensive applications as video processing.

As Table 1 on pg 86 shows, a range of SCSI-interface RAID products is available for PCs and workstations. The products range from inexpensive controller boards and software packages for those who wish to configure their own RAID systems, to complete self-contained drive subsystems.

This article discusses products at the "low end" of the RAID spectrum, mainly sub-$15,000 systems that are likely to be of interest to designers. High-end systems (note the Core Array 2000 in Table 1), costing many tens of thousands of dollars, are more in the domain of MIS managers and network administrators.

RAID should be of interest to designers in large CAD systems. Joel Hagberg, RAID-product manager for Micropolis, cites the example of engineers sharing files over a network, a common scenario in many design facilities. RAID techniques are appropriate for such systems, both for faster data transfer and for fault tolerance. Although you can use tape-backup systems to provide fault tolerance, they have not kept pace with the advances in drive technology, according to Hagberg. Backing up with tape is agonizingly slow.

Though this article (and Table 1) deals with rotating-disk RAID systems, one emerging solid-state system is also worthy of mention. A 320-Mbyte flash-storage system, using RAID levels 3 and 5, is available from Raymond Engineering Inc. Designed for rugged environments (such as 1500g shock), the system is about the size of a large coffee mug. As usual, however, solid-state mass storage cannot compete with rotating media: The system, available in military versions now, costs $30,000 to $50,000.


What RAID level is best for you?

RAID accelerates data transfer through "striping"--distributing segments of the data across multiple drives. Two RAID techniques provide redundancy in systems requiring fault tolerance: "mirroring" and exclusive-OR parity architectures.

Fig 1 provides a simplified block diagram of the RAID levels in popular usage. RAID 0 (Fig 1a) uses pure striping to achieve high transfer rates. The stripes usually comprise large segments of data. For single-user applications that demand access to long sequential records, however, the stripes can be sector-sized (if the spindles are synchronized).

RAID 0 is the speediest and most disk-efficient of the levels. Its disadvantage is the total lack of redundancy--if one disk fails, the system fails. RAID 1 (Fig 1b), on the other hand, is the least disk-efficient but provides total storage redundancy. In RAID 1, identical data is mirrored to two disks. The disadvantages of RAID 1 are the expense (double the number of disks), and the fact that there's no performance improvement over a single-disk system.

Fig 1 does not illustrate RAID 2, which has fallen out of favor and become all but extinct. This level is identical to RAID 0--data is striped across multiple disks--except for the addition of drives that store error-correction code (ECC) information. Most modern disk drives come with embedded ECC, so RAID 2 serves no useful function.

RAID 3 (Fig 1c) uses striping and provides redundancy via a disk dedicated to storing parity information. In the case of a drive failure, the system recovers data by generating the exclusive OR of the data on the remaining drives. RAID 4 is structurally similar to RAID 3 but uses larger stripes, so that reads can originate from any drive (except the parity drive), thus allowing overlapping of read operations. Another difference between RAID 3 and 4: A RAID 3 array executes one I/O request at a time, involving all member disks equally, and a RAID 4 array can process multiple requests simultaneously, as long as the specified data maps to different member disks (Ref 1).

RAID 3 requires synchronized spindle drives to avoid performance degradation with short records. It offers good throughput rates for large data transfers but relatively poor media efficiency for small data blocks. RAID 3 is appropriate for imaging applications, for example.

A RAID 5 system (Fig 1d) spreads both the data and the parity information across all the drives. Unlike with RAID 3 or 4, each drive takes turns storing parity for a different series of stripes. Because there is no dedicated parity drive, it's possible to overlap read and write operations. The overhead in RAID 5 is the time it takes during write operations to regenerate the parity information. The result is a degradation of 40 to 66% in write speed, as compared with a RAID 1 (mirroring) array. RAID 5 is thus an appropriate choice in multiuser applications that require few write operations.

RAID 5 finds use in many more systems than does RAID 4. The fact that its parity chunks are distributed across multiple array members, rather than being concentrated on a dedicated parity disk, relieves the write bottleneck that plagues RAID 4. Table 2 summarizes the RAID levels and provides some indication of their respective suitability for particular applications.

Table 1 includes the terms "hot-pluggable," "hot-swappable," and "hot-spareable," which refer to the replacement of a failed drive. The RAID levels that use parity for fault tolerance provide transparent on-the-fly reconstruction of data in the event of a failed drive. In hot-pluggable, hot-swappable systems, you can remove and replace a failed drive without powering down the system. Systems that have a hot-spareable capability incorporate a spare drive that automatically replaces a failed drive after data reconstruction.


Many buses, operating systems

As Table 1 shows, available RAID products support multiple bus interfaces. They also support a variety of buses. As a couple of examples, AdvanSys SCSI host adapters are available for VESA and EISA slots; those from Distributed Processing Technology provide interfaces to ISA, EISA, and PCI buses. Most of the products also provide support for a wide range of operating systems. For example, the AdvanSys host adapters support DOS 4.0 or higher, Windows 3.1 or higher and NT, Novell NetWare 3.12 and 4.01, OS/2 2.1 and higher, SCO Unix 4.2 and higher, and Interactive Unix 3.2 or higher.

Because RAID techniques are appropriate for multiuser network applications, most of the products support such networking systems and software as Banyan Vines and Novell NetWare. Table 1 also shows that the transfer rate for most systems is 10 Mbytes/sec, characteristic of the Fast SCSI-2 host adapters. Some 20-Mbyte/sec products are available, thanks either to duplexing (for example, Megadrive's Power Array) or to the use of Fast and Wide SCSI-2 (for example, Megadrive's MR/10 and Peripheral Land Inc's QuickArray-W). With upcoming PCI-bus products, you can expect faster than 30-Mbyte/sec transfer rates. In the more distant future, look for better than 100-Mbyte/sec Fibre Channel systems.

The terms "software-based" and "hardware-based" in Table 1 also deserve some explanation. Software-based subsystems make use of the host system's CPU for disk-array management chores; the hardware-based systems contain their own dedicated processors. A popular choice is the i960, though some products use an ASIC RISC processor.

You can save some money by purchasing a software-based system, at the expense of speed. Micronet Technology's software- and hardware-based subsystems provide examples. The RISC µP in the hardware system relieves the system CPU of RAID chores and thus doubles the transfer rate, compared with the software system.


Build or buy?

For RAID systems, as with almost every aspect of system design, you face the decision: Roll your own or purchase? Hard-disk drives are relatively inexpensive and becoming more so all the time. If you have a lot of time, you could start from scratch, write your own RAID drive-management code, and configure your own controller and host adapter.

On the face of it, you'd undoubtedly save money, but you'd face a daunting design task. To quote from the release for AT&T/NCR's Series 3 controller, "…the Series 3 controllers are driven by more than a quarter-million lines of embedded RAID software code." Starting not quite from scratch, you could purchase RAID software, such as Paragon 4.0 from BusLogic and configure your own host-adapter and drive-controller hardware. Ref 2 takes you step-by-step through building your own RAID-5 NetWare array, using Paragon 4.0 software.

However, configuring your own RAID subsystem can be scary. Greg Goelv, marketing director for Micropolis, offers the analogy of building a Rolls Royce from a kit rather than buying the car with a tuned engine. Goelv maintains that you're likely to run into trouble if, for example, you want to upgrade your system by adding more disk drives.

Goelv asserts that disk-drive manufacturers often make subtle design changes even for the same drive model, both in firmware and in SCSI code. As a result, you could end up with incompatibility headaches. RAID-subsystem manufacturers, on the other hand, guarantee backward compatibility for their expansion and upgrade products.

Many standards, buses, interfaces, and such go through incompatibility problems; sometimes, vague or ambiguous specs lead to interoperability headaches when you try to mate different manufacturers' products. To avoid such problems with RAID systems, the RAID Advisory Board has done an excellent job of setting standards and specs for RAID technology.


References

1. The RAIDBook, RAID Advisory Board. $9.95.

2."Building your own RAID-5 NetWare disk array", Chantal Systems, Division of BusLogic Inc.

3."What are disk arrays?" AT&T/NCR.

4."Why disk arrays?" Chantal Systems, Division of BusLogic Inc.


REPRESENTATIVE PC-BASED SCSI RAID CONTROLLERS AND SUBSYSTEMS

Company Model no. Product type RAID
levels
Transfer rate
(Mbytes/sec)
Price Comments
AdvanSys ABP852





ABP742





ABP752
Two-channel SCSI host adapter for VESA slot



One-channel SCSI host adapter for EISA slot




Two-channel SCSI host adapter for EISA slot
4, 5





4, 5





4, 5
10





10





20
$479


$699


$439


$599


$479


$699
Silver Kit: includes Corel SCSI Version 2 drivers

Gold Kit: includes Corel Network Manager, Jukebox software

Silver Kit: includes Corel SCSI Version 2 drivers

Gold Kit: includes Corel Network Manager, Jukebox software

Silver Kit: includes Corel SCSI Version 2 drivers

Gold Kit: includes Corel Network manager, Jukebox software
AT&T/
NCR
Series 3 PCI-bus SCSI controller 0, 1, 3, 5 10 $3100 Fast 20-SCSI version due in the fourth quarter; Fibre Channel version due in 1995
BusLogic DA-2988



DA-4998

DA-2788



Paragon 40

Two-channel PCI-bus SCSI controller


Four-channel PCI-bus SCSI controller
Two-channel EISA-bus SCSI controller

Disk-array software for NetWare 40
0, 1, 3, 5



0, 1, 3, 5

0, 1, 3, 5



0, 1, 5

10



10

10



7

$1795



$2095

$1495



$995

All DA models include Disk-Array Manager network-management utility; 32-bit RISC µP controls bus-interface and DMA array-controller ASICs





Provides failed-disk reconstruction, remote diagnostics
Core
International
Micro 2000




CORE Array 2000
5.25-in., 2-Gbyte RAID drive



2- to 20-Gbyte hardware subsystem
3, 5




3, 5
10




10
$7999




$27,999
(20 Gbytes)
All models offer optional hot-spare drive capability
Designed for small networksand imaging workstations

Designed for large networks
Distributed Processing Technology DPT-XXXSC/2 Series
DPT-XXXSC/4 Series
DPT-XXXTWR/3 Series

DPT-XXXTWR/6 Series

DPT-XXXTWR/8 Series

PM3XXX
22-Gbyte dual drives for ISA/EISA/PCI bus
42-Gbyte dual drives for ISA/EISA/PCI bus
33-Gbyte dual drives for ISA/EISA/PCI bus

63-Gbyte dual drives for ISA/EISA/PCI bus

84-Gbyte dual drives for ISA/EISA/PCI bus

RAID controllers for ISA/EISA/PCI bus
0, 1, 5

0, 1, 5

0, 1, 5


0, 1, 5


0, 1, 5


0, 1, 5
10

10

10


10


10


10
$6975 to $7175

$8975 to $9175

$10,670 to $10,870

$13,670 to $13,870

$16,465 to $16,665

$1595 to $1795
All DPT models accommodate hot-spareable, hot-pluggable drives











Monitors temperature, voltage of array cabinet and network server
Hewlett-
Packard
Disk System Fast SCSI-2 RAID arrays; contain single to quintuple disk modules; 0.51 to 2 Gbytes/module 0, 1, 5 10 $3019 to $15,399 Works with C3609A ($2699) dual-channel EISA disk-array controller and bus adapters D2078B (EISA, $445), C3605A (ISA, $502), C3606A (MCA, $703)
Industrial Computer Source MAI00500


MAI00800
5.25-in., 510-Mbyte RAID drive

5.25-in., 840-Mbyte RAID drive
5.25-in., 510-Mbyte RAID drive
3, 5


3, 5

3, 5
10


10

10
$4795


$5995

$7995
All MAI models use 25-Mhz RISC CPU, 2-Mbyte cache; interface with SCSI-2 controller
Legacy Storage Systems DAC960




SmartArray XE
SCSI-2 RAID controller for one, two, three, or five buses


12-drive RAID array, storage to 5.2 Gbytes
0, 1, 5




0, 1, 5
10




NA
$5440
(5-channel)




$8495
(2 Gbytes)
Uses i960 µP and accommodates cache to 64 Mbytes; comes with storage-management diagnostics utility

Software-based RAID system; transfer rate dependent on system; hardware configuration with DAC960 yields 10-Mbyte/sec transfer rate
Lomas Data Products LDP Cache IV ISA-bus, SCSI-interface disk controller 1 5 $795 Controls SCSI drives, presents ISA-bus (WD1003) interface to system; supports 2 to 16 Mbytes of user-installed cache
Megadrive Power Array Family



MR/10
Single and duplexed
Fast SCSI-2 RAID arrays, 0.535 to 8 Gbytes

Fast and Wide
SCSI-2 RAID arrays, 2.6 to 20 Gbytes
0, 1, 5




0, 3, 5
10
(20 duplexed)




20
from $2899




from $14,360
Uses removable, hot-swappable drive modules; supports single and parallel dual-ported host adapters


Uses removable, hot-swappable drive modules; includes GUI for system monitoring, on-screen programming
Micronet Technology Raidbank




Raidband Plus
Two-channel software-based RAID subsystem


One- and Two-channel RAID subsystem
0, 1, 5




0, 1, 5 hardware-
based
64




112
$4395




$5995
(one-channel)
$6745
(two-channel)
Accomodates drive modules from 0.53 to 2 Gbytes ($1295 to $2695) for storage of 1 to 28 Gbytes per cabinet

Uses i960 RISC µP; contains 4 Mbytes of cache RAM, expandable to 64 Mbytes; accommodates drive modules from 0.53 to 2 Gbytes ($1295 to $2695) for storage of 1 to 12 Gbytes per cabinet
Micropolis Raidion Series 3.5- and 5.2 Gbyte hardware-based
Wide SCSI-2 arrays
5 Plus 20 $11,295
(3.5 Gbytes)
$13,795
(5.2 Gbytes)
Uses Gandiva controller board with 32-bit RISC µP on internal PCI bus; 10-Mbyte/sec software-based Raidion arrays also available with $495 RaidWare for Novell NetWare systems
Mylex DAC960P PCI-bus RAID controller board 0, 1, 5 30 $1898 Uses i960 RISC µP; accommadates 2 to 32 Mbytes of DRAM of EDRAM cache; provides on-the-fly reconstruction, support for swappable, hot-spareable drives
Peripheral Land Inc QuickArray R

RAID drives

QuickArray-W


QuickArray Kit
1-, 2-, 4-Gbyte
Fast SCSI-2


1-, 2-, 4-Gbyte
Fast and Wide SCSI
RAID drives
SCSI-2 controller and two-drive bay
0, 1



0, 1
10



20
$2842 (1 Gbyte)
$3081 (2 Gbyte)
$5170 (4 Gbyte)

$3025 (1 Gbyte)
$3290 (2 Gbyte)
$5625 (4 Gbyte)
$1495
QuickArrays appear as a single SCSI drive to host system; can be daisy-chained to 28 Gbytes of storage



User supplies own drives


ATTRIBUTES OF RAID LEVELS 0 TO 5

RAID level Structure Unsuitable applications Suitable applications
0 Pure striping, no parity Those making sequential requests for small amounts of data
Those making synchronous random requests for small amounts of data
Those requiring fault tolerance
Program-image libraries or runtime libraries for rapid loading
Those needing rapid access to large tables or other read-only files
Data collection from external sources at very high data-transfer rates
1 Pure mirroring Cost-sensitive systems Read-intensive, fault-tolerant systems
2 Striping with ECC All None
3 Striping with parity disk Those requesting small blocks of data Those requesting large blocks of data
Noninteractive applications that process large files sequentially
4 Striping with parity disk Those with many write operations Those using mainly read requests
Those that require high I/O-request rates vs high data-transfer rates
5 Striping with distributed parity Data collection from processes
Continually updated systems
Those with high read-request rates, with a low percentage of write requests
Transaction processing, office automation, on-line customer service


Manufacturers of raid controllers and subsystems
For free information on the signal-processing products discussed in this article, circle the appropriate numbers on the postage-paid Information Retrieval Service card or use EDN's Express Request service. When you contact any of the following manufacturers directly, please let them know you read about their products in EDN.
AdvanSys
San Jose, CA
(408) 383-9400
AT&T/NCR
Dayton, OH
(800) 334-5454
BusLogic Inc
Santa Clara, CA
(408) 492-9090
Core International
Boca Raton, FL
(407) 997-6055
Distributed Processing Technology
Maitland, FL
(407) 830-5522
Hewlett-Packard Co
Palo Alto, CA
(800) 826-4111
Industrial Computer Source
San Diego, CA
(800) 523-2320
Legacy Storage Systems
Markham, ON, Canada
(905) 475-1077
Lomas Data Products Inc
Marlboro, MA
(508) 460-0333
Megadrive Systems Inc
Beverly Hills, CA
(310) 247-0006
Micronet Technology Inc
Irvine, CA
(714) 453-6000
Micropolis Corp
Chatsworth, CA
(818) 709-3300
Mylex Corp
Fremont, CA
(510) 796-6100
Peripheral Land Inc
Fremont, CA
(510) 657-2211
RAID Advisory Board
Technology Forums

Line Lakes, MN
(612) 784-2379
Raymond Engineering Inc
Middletown, CT
(203) 632-4643


| EDN Access | feedback | subscribe to EDN! |
| design features | design ideas | columnist |


Copyright © 1995 EDN Magazine. EDN is a registered trademark of Reed Properties Inc, used under license.