
If you worry about losing data through hard-disk failures, RAID can put your mind at ease. RAID stands for (take your choice) redundant array of inexpensive/independent/-individual disks/drives. RAID techniques can also accelerate disk-drive data-transfer rates for such speed-intensive applications as video processing.
As Table 1 on pg 86 shows, a range of SCSI-interface RAID products is available for PCs and workstations. The products range from inexpensive controller boards and software packages for those who wish to configure their own RAID systems, to complete self-contained drive subsystems.
This article discusses products at the "low end" of the RAID spectrum, mainly sub-$15,000 systems that are likely to be of interest to designers. High-end systems (note the Core Array 2000 in Table 1), costing many tens of thousands of dollars, are more in the domain of MIS managers and network administrators.
RAID should be of interest to designers in large CAD systems. Joel Hagberg, RAID-product manager for Micropolis, cites the example of engineers sharing files over a network, a common scenario in many design facilities. RAID techniques are appropriate for such systems, both for faster data transfer and for fault tolerance. Although you can use tape-backup systems to provide fault tolerance, they have not kept pace with the advances in drive technology, according to Hagberg. Backing up with tape is agonizingly slow.
Though this article (and Table 1) deals with rotating-disk RAID systems, one emerging solid-state system is also worthy of mention. A 320-Mbyte flash-storage system, using RAID levels 3 and 5, is available from Raymond Engineering Inc. Designed for rugged environments (such as 1500g shock), the system is about the size of a large coffee mug. As usual, however, solid-state mass storage cannot compete with rotating media: The system, available in military versions now, costs $30,000 to $50,000.
What RAID level is best for you?
RAID accelerates data transfer through "striping"--distributing segments of the data across multiple drives. Two RAID techniques provide redundancy in systems requiring fault tolerance: "mirroring" and exclusive-OR parity architectures.
Fig 1 provides a simplified block diagram of the RAID levels in popular usage. RAID 0 (Fig 1a) uses pure striping to achieve high transfer rates. The stripes usually comprise large segments of data. For single-user applications that demand access to long sequential records, however, the stripes can be sector-sized (if the spindles are synchronized).
RAID 0 is the speediest and most disk-efficient of the levels. Its disadvantage is the total lack of redundancy--if one disk fails, the system fails. RAID 1 (Fig 1b), on the other hand, is the least disk-efficient but provides total storage redundancy. In RAID 1, identical data is mirrored to two disks. The disadvantages of RAID 1 are the expense (double the number of disks), and the fact that there's no performance improvement over a single-disk system.
Fig 1 does not illustrate RAID 2, which has fallen out of favor and become all but extinct. This level is identical to RAID 0--data is striped across multiple disks--except for the addition of drives that store error-correction code (ECC) information. Most modern disk drives come with embedded ECC, so RAID 2 serves no useful function.
RAID 3 (Fig 1c) uses striping and provides redundancy via a disk dedicated to storing parity information. In the case of a drive failure, the system recovers data by generating the exclusive OR of the data on the remaining drives. RAID 4 is structurally similar to RAID 3 but uses larger stripes, so that reads can originate from any drive (except the parity drive), thus allowing overlapping of read operations. Another difference between RAID 3 and 4: A RAID 3 array executes one I/O request at a time, involving all member disks equally, and a RAID 4 array can process multiple requests simultaneously, as long as the specified data maps to different member disks (Ref 1).
RAID 3 requires synchronized spindle drives to avoid performance degradation with short records. It offers good throughput rates for large data transfers but relatively poor media efficiency for small data blocks. RAID 3 is appropriate for imaging applications, for example.
A RAID 5 system (Fig 1d) spreads both the data and the parity information across all the drives. Unlike with RAID 3 or 4, each drive takes turns storing parity for a different series of stripes. Because there is no dedicated parity drive, it's possible to overlap read and write operations. The overhead in RAID 5 is the time it takes during write operations to regenerate the parity information. The result is a degradation of 40 to 66% in write speed, as compared with a RAID 1 (mirroring) array. RAID 5 is thus an appropriate choice in multiuser applications that require few write operations.
RAID 5 finds use in many more systems than does RAID 4. The fact that its parity chunks are distributed across multiple array members, rather than being concentrated on a dedicated parity disk, relieves the write bottleneck that plagues RAID 4. Table 2 summarizes the RAID levels and provides some indication of their respective suitability for particular applications.
Table 1 includes the terms "hot-pluggable," "hot-swappable," and "hot-spareable," which refer to the replacement of a failed drive. The RAID levels that use parity for fault tolerance provide transparent on-the-fly reconstruction of data in the event of a failed drive. In hot-pluggable, hot-swappable systems, you can remove and replace a failed drive without powering down the system. Systems that have a hot-spareable capability incorporate a spare drive that automatically replaces a failed drive after data reconstruction.
Many buses, operating systems
As Table 1 shows, available RAID products support multiple bus interfaces. They also support a variety of buses. As a couple of examples, AdvanSys SCSI host adapters are available for VESA and EISA slots; those from Distributed Processing Technology provide interfaces to ISA, EISA, and PCI buses. Most of the products also provide support for a wide range of operating systems. For example, the AdvanSys host adapters support DOS 4.0 or higher, Windows 3.1 or higher and NT, Novell NetWare 3.12 and 4.01, OS/2 2.1 and higher, SCO Unix 4.2 and higher, and Interactive Unix 3.2 or higher.
Because RAID techniques are appropriate for multiuser network applications, most of the products support such networking systems and software as Banyan Vines and Novell NetWare. Table 1 also shows that the transfer rate for most systems is 10 Mbytes/sec, characteristic of the Fast SCSI-2 host adapters. Some 20-Mbyte/sec products are available, thanks either to duplexing (for example, Megadrive's Power Array) or to the use of Fast and Wide SCSI-2 (for example, Megadrive's MR/10 and Peripheral Land Inc's QuickArray-W). With upcoming PCI-bus products, you can expect faster than 30-Mbyte/sec transfer rates. In the more distant future, look for better than 100-Mbyte/sec Fibre Channel systems.
The terms "software-based" and "hardware-based" in Table 1 also deserve some explanation. Software-based subsystems make use of the host system's CPU for disk-array management chores; the hardware-based systems contain their own dedicated processors. A popular choice is the i960, though some products use an ASIC RISC processor.
You can save some money by purchasing a software-based system, at the expense of speed. Micronet Technology's software- and hardware-based subsystems provide examples. The RISC µP in the hardware system relieves the system CPU of RAID chores and thus doubles the transfer rate, compared with the software system.
Build or buy?
For RAID systems, as with almost every aspect of system design, you face the decision: Roll your own or purchase? Hard-disk drives are relatively inexpensive and becoming more so all the time. If you have a lot of time, you could start from scratch, write your own RAID drive-management code, and configure your own controller and host adapter.
On the face of it, you'd undoubtedly save money, but you'd face a daunting design task. To quote from the release for AT&T/NCR's Series 3 controller, " the Series 3 controllers are driven by more than a quarter-million lines of embedded RAID software code." Starting not quite from scratch, you could purchase RAID software, such as Paragon 4.0 from BusLogic and configure your own host-adapter and drive-controller hardware. Ref 2 takes you step-by-step through building your own RAID-5 NetWare array, using Paragon 4.0 software.
However, configuring your own RAID subsystem can be scary. Greg Goelv, marketing director for Micropolis, offers the analogy of building a Rolls Royce from a kit rather than buying the car with a tuned engine. Goelv maintains that you're likely to run into trouble if, for example, you want to upgrade your system by adding more disk drives.
Goelv asserts that disk-drive manufacturers often make subtle design changes even for the same drive model, both in firmware and in SCSI code. As a result, you could end up with incompatibility headaches. RAID-subsystem manufacturers, on the other hand, guarantee backward compatibility for their expansion and upgrade products.
Many standards, buses, interfaces, and such go through incompatibility problems; sometimes, vague or ambiguous specs lead to interoperability headaches when you try to mate different manufacturers' products. To avoid such problems with RAID systems, the RAID Advisory Board has done an excellent job of setting standards and specs for RAID technology.
1. The RAIDBook, RAID Advisory Board. $9.95.
2."Building your own RAID-5 NetWare disk array", Chantal Systems, Division of BusLogic Inc.
3."What are disk arrays?" AT&T/NCR.
4."Why disk arrays?" Chantal Systems, Division of BusLogic Inc.
REPRESENTATIVE PC-BASED SCSI RAID CONTROLLERS AND SUBSYSTEMS |
||||||
|---|---|---|---|---|---|---|
| Company | Model no. | Product type | RAID levels |
Transfer rate (Mbytes/sec) |
Price | Comments |
| AdvanSys | ABP852 ABP742 ABP752 |
Two-channel SCSI host adapter for VESA slot One-channel SCSI host adapter for EISA slot Two-channel SCSI host adapter for EISA slot |
4, 5 4, 5 4, 5 |
10 10 20 |
$479 $699 $439 $599 $479 $699 |
Silver Kit: includes Corel SCSI Version 2 drivers Gold Kit: includes Corel Network Manager, Jukebox software Silver Kit: includes Corel SCSI Version 2 drivers Gold Kit: includes Corel Network Manager, Jukebox software Silver Kit: includes Corel SCSI Version 2 drivers Gold Kit: includes Corel Network manager, Jukebox software |
| AT&T/ NCR |
Series 3 | PCI-bus SCSI controller | 0, 1, 3, 5 | 10 | $3100 | Fast 20-SCSI version due in the fourth quarter; Fibre Channel version due in 1995 |
| BusLogic | DA-2988 DA-4998 DA-2788 Paragon 40 |
Two-channel PCI-bus SCSI controller Four-channel PCI-bus SCSI controller Two-channel EISA-bus SCSI controller Disk-array software for NetWare 40 |
0, 1, 3, 5 0, 1, 3, 5 0, 1, 3, 5 0, 1, 5 |
10 10 10 7 |
$1795 $2095 $1495 $995 |
All DA models include Disk-Array Manager network-management utility; 32-bit RISC µP controls bus-interface and DMA array-controller ASICs Provides failed-disk reconstruction, remote diagnostics |
| Core International |
Micro 2000 CORE Array 2000 |
5.25-in., 2-Gbyte RAID drive 2- to 20-Gbyte hardware subsystem |
3, 5 3, 5 |
10 10 |
$7999 $27,999 (20 Gbytes) |
All models offer optional hot-spare drive capability Designed for small networksand imaging workstations Designed for large networks |
| Distributed Processing Technology | DPT-XXXSC/2 Series DPT-XXXSC/4 Series DPT-XXXTWR/3 Series DPT-XXXTWR/6 Series DPT-XXXTWR/8 Series PM3XXX |
22-Gbyte dual drives for ISA/EISA/PCI bus 42-Gbyte dual drives for ISA/EISA/PCI bus 33-Gbyte dual drives for ISA/EISA/PCI bus 63-Gbyte dual drives for ISA/EISA/PCI bus 84-Gbyte dual drives for ISA/EISA/PCI bus RAID controllers for ISA/EISA/PCI bus |
0, 1, 5 0, 1, 5 0, 1, 5 0, 1, 5 0, 1, 5 0, 1, 5 |
10 10 10 10 10 10 |
$6975 to $7175 $8975 to $9175 $10,670 to $10,870 $13,670 to $13,870 $16,465 to $16,665 $1595 to $1795 |
All DPT models accommodate hot-spareable, hot-pluggable drives Monitors temperature, voltage of array cabinet and network server |
| Hewlett- Packard |
Disk System | Fast SCSI-2 RAID arrays; contain single to quintuple disk modules; 0.51 to 2 Gbytes/module | 0, 1, 5 | 10 | $3019 to $15,399 | Works with C3609A ($2699) dual-channel EISA disk-array controller and bus adapters D2078B (EISA, $445), C3605A (ISA, $502), C3606A (MCA, $703) |
| Industrial Computer Source | MAI00500 MAI00800 |
5.25-in., 510-Mbyte RAID drive 5.25-in., 840-Mbyte RAID drive 5.25-in., 510-Mbyte RAID drive |
3, 5 3, 5 3, 5 |
10 10 10 |
$4795 $5995 $7995 |
All MAI models use 25-Mhz RISC CPU, 2-Mbyte cache; interface with SCSI-2 controller |
| Legacy Storage Systems | DAC960 SmartArray XE |
SCSI-2 RAID controller for one, two, three, or five buses 12-drive RAID array, storage to 5.2 Gbytes |
0, 1, 5 0, 1, 5 |
10 NA |
$5440 (5-channel) $8495 (2 Gbytes) |
Uses i960 µP and accommodates cache to 64 Mbytes; comes with storage-management diagnostics utility Software-based RAID system; transfer rate dependent on system; hardware configuration with DAC960 yields 10-Mbyte/sec transfer rate |
| Lomas Data Products | LDP Cache IV | ISA-bus, SCSI-interface disk controller | 1 | 5 | $795 | Controls SCSI drives, presents ISA-bus (WD1003) interface to system; supports 2 to 16 Mbytes of user-installed cache |
| Megadrive | Power Array Family MR/10 |
Single and duplexed Fast SCSI-2 RAID arrays, 0.535 to 8 Gbytes Fast and Wide SCSI-2 RAID arrays, 2.6 to 20 Gbytes |
0, 1, 5 0, 3, 5 |
10 (20 duplexed) 20 |
from $2899 from $14,360 |
Uses removable, hot-swappable drive modules; supports single and parallel dual-ported host adapters Uses removable, hot-swappable drive modules; includes GUI for system monitoring, on-screen programming |
| Micronet Technology | Raidbank Raidband Plus |
Two-channel software-based RAID subsystem One- and Two-channel RAID subsystem |
0, 1, 5 0, 1, 5 hardware- based |
64 112 |
$4395 $5995 (one-channel) $6745 (two-channel) |
Accomodates drive modules from 0.53 to 2 Gbytes ($1295 to $2695) for storage of 1 to 28 Gbytes per cabinet Uses i960 RISC µP; contains 4 Mbytes of cache RAM, expandable to 64 Mbytes; accommodates drive modules from 0.53 to 2 Gbytes ($1295 to $2695) for storage of 1 to 12 Gbytes per cabinet |
| Micropolis | Raidion Series | 3.5- and 5.2 Gbyte hardware-based Wide SCSI-2 arrays |
5 Plus | 20 | $11,295 (3.5 Gbytes) $13,795 (5.2 Gbytes) |
Uses Gandiva controller board with 32-bit RISC µP on internal PCI bus; 10-Mbyte/sec software-based Raidion arrays also available with $495 RaidWare for Novell NetWare systems |
| Mylex | DAC960P | PCI-bus RAID controller board | 0, 1, 5 | 30 | $1898 | Uses i960 RISC µP; accommadates 2 to 32 Mbytes of DRAM of EDRAM cache; provides on-the-fly reconstruction, support for swappable, hot-spareable drives |
| Peripheral Land Inc | QuickArray R RAID drives QuickArray-W QuickArray Kit |
1-, 2-, 4-Gbyte Fast SCSI-2 1-, 2-, 4-Gbyte Fast and Wide SCSI RAID drives SCSI-2 controller and two-drive bay |
0, 1 0, 1 |
10 20 |
$2842 (1 Gbyte) $3081 (2 Gbyte) $5170 (4 Gbyte) $3025 (1 Gbyte) $3290 (2 Gbyte) $5625 (4 Gbyte) $1495 |
QuickArrays appear as a single SCSI drive to host system; can be daisy-chained to 28 Gbytes of storage User supplies own drives |
ATTRIBUTES OF RAID LEVELS 0 TO 5 |
|||
|---|---|---|---|
| RAID level | Structure | Unsuitable applications | Suitable applications |
| 0 | Pure striping, no parity | Those making sequential requests for small amounts of data Those making synchronous random requests for small amounts of data Those requiring fault tolerance |
Program-image libraries or runtime libraries for rapid loading Those needing rapid access to large tables or other read-only files Data collection from external sources at very high data-transfer rates |
| 1 | Pure mirroring | Cost-sensitive systems | Read-intensive, fault-tolerant systems |
| 2 | Striping with ECC | All | None |
| 3 | Striping with parity disk | Those requesting small blocks of data | Those requesting large blocks of data Noninteractive applications that process large files sequentially |
| 4 | Striping with parity disk | Those with many write operations | Those using mainly read requests Those that require high I/O-request rates vs high data-transfer rates |
| 5 | Striping with distributed parity | Data collection from processes Continually updated systems |
Those with high read-request rates, with a low percentage of write requests Transaction processing, office automation, on-line customer service |
| Manufacturers of raid controllers and subsystems | ||
|---|---|---|
| For free information on the signal-processing products discussed in this article, circle the appropriate numbers on the postage-paid Information Retrieval Service card or use EDN's Express Request service. When you contact any of the following manufacturers directly, please let them know you read about their products in EDN. | ||
| AdvanSys San Jose, CA (408) 383-9400 |
AT&T/NCR Dayton, OH (800) 334-5454 |
BusLogic Inc Santa Clara, CA (408) 492-9090 |
| Core International Boca Raton, FL (407) 997-6055 |
Distributed Processing Technology Maitland, FL (407) 830-5522 |
Hewlett-Packard Co Palo Alto, CA (800) 826-4111 |
| Industrial Computer Source San Diego, CA (800) 523-2320 |
Legacy Storage Systems Markham, ON, Canada (905) 475-1077 |
Lomas Data Products Inc Marlboro, MA (508) 460-0333 |
| Megadrive Systems Inc Beverly Hills, CA (310) 247-0006 |
Micronet Technology Inc Irvine, CA (714) 453-6000 |
Micropolis Corp Chatsworth, CA (818) 709-3300 |
| Mylex Corp Fremont, CA (510) 796-6100 |
Peripheral Land Inc Fremont, CA (510) 657-2211 |
RAID Advisory Board Technology Forums Line Lakes, MN (612) 784-2379 |
| Raymond Engineering Inc Middletown, CT (203) 632-4643 |
||