Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 1 of 42
CAN application in avionics Final Report ESTEC Omnisys Instruments AB Dr. Anders Emrich
This report look into various aspects of using the CAN-bus in the space environment. The main focus is on the electrcial interface and distribution, but some discussion regarding the high level protocol aspects is also included. Three implementations of the electrical interface are discussed and one of them has been evaluted with a hardware breadboard. Spice simulations has been performed for two of the implementations.. It seems quite clear that for a majority of applications, the ISO high speed, differential bus is to be prefered. If high voltage isolation, it is possible to consider local opto isolators without changing the rest of the system. For voltage isolation between several subsystems, but the distance being short, the transformer isolation could be considered, but it requires a non standard modulation on the bus.
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 2 of 42
Introduction............................................................................................................ 5
1.1 1.2 1.3 Overview........................................................................................................... 5 Basic Technical Description of Standard CAN............................................... 6 Suggestion on selection of bus type.................................................................... 6
System Redundancy.............................................................................................. 29
6.1 6.2 Cold Redundancy .............................................................................................29 Warm Redundancy ..........................................................................................31
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 3 of 42
Introduction.....................................................................................................38 Basic assumption and preferences ....................................................................38 Essential Communication.................................................................................40 Payload Communication ..................................................................................40 System priority.................................................................................................40 Start-up ............................................................................................................41
9 10
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 4 of 42
List of Abbreviations
ASIC Application Specific Integrated Circuit CAN Controller Area Network ESA European Space Agency FPGA Field Programmable Gate Array ISO International Standard Organisation OBDH On Board Data Handling SMART-1 ESA satellite TBC To Be Confirmed TBD To Be Defined VHDL VHSIC Hardware Description Language UART Universal Asynchronous Receiver Transmitter WP Work Package WPD Work Package Description
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 5 of 42
1
1.1
Introduction
Overview
Modern satellites use either point to point communication or one of several buses for subsystem communication. The most common buses in use are:
MIL-STD-1553 or 1773 for most US satellites and some ESA satellites ESA OBDH on some ESA satellites
These communication buses have several drawbacks, the common ones are: Single master type (requires critical and awkward central planning, tricky to implement redundancy) High power consumption Costly development support Furthermore, in the case of OBDH there is virtually no hardware or software available today. An alternative to the MIL-STD-1553 and the ESA OBDH that could be considered is the adaption of a standard commercial bus for use in space. One such bus with proven reliability and with an abundance of development tools is the CAN-bus, currently being used in some 90 % of all new cars and the most common bus for modern industry automation projects. There are however two problems that must be solved when considering using the CAN-bus in space: 1: Availability of radiation tolerant chips, 2: Support for hardware redundancy. (1) can be solved by using modern FPGAs that now have enough complexity to support a complete CAN-controller, while a concept for (2) exist, originating from work performed during the SMART-1 phase B. MIL-STD-1553: HighRel type radiation hard components exist single master type of bus widely used in military systems and US space projects fair development support transformer coupled high power consumption.
OBDH bus: HighRel type radiation hard components exist single master type of bus only used in some ESA space projects limited development support transformer coupled medium power consumption
CAN-bus: Used in automotive and industrial applications multimaster type of bus no radiation tolerant components available but FPGA implementations possible excellent development support choice of physical implementation, opto coupler, transformer etc. very flexible very reliable (sent bit check, stuff bits, CRC etc.) Direct point to point interfaces complicated redundancy scheme
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 6 of 42
1.2
This medium access method is very efficient because no bandwidth is lost during arbitration. Bus throughput is high under both light and heavy traffic conditions, reaching 1,000 msgs/s at 125Kbps and 8,000 msgs/s at 1Mbps. CAN provides five error detection mechanisms, including a 15-bit cyclic redundancy check (CRC) code that detects nearly all potential message bit errors. The CAN protocol has its own limitations. Because CAN nodes must listen to the bus while transmitting, the bit length must be at least twice the propagation delay. Therefore, high speeds are only supported for short buses (500m for 125 Kbps, 100m for 500Kbps, and 50m for 1Mbps). Some applications require electrical isolation between the bus and nodes. Bit/s: Msgs/s: Package: Behavior: 100k-1M 8000 (at 1 Mbit/s) 8 bytes deterministic
1.3
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 7 of 42
2.1
Number of buses
A very early question for the design of satellites is the type and number of buses to use for control and data read-out. It can happen that a decision is made to reduce this, even down to one bus, very early in the design process. The result from this is not only electrical, it also involves the overall project organisation, i.e. the organisation structure must reflect how many parties needs to be coordinated. Based on experience, at least two different buses should be considered, one essential and one for the payload. The reason is both organisational and physical. The essential subsystem is commonly pure industrial with its participants, i.e. formal specifications, test procedures etc. In addition, there is a lot of re-use from project to project in this area. By contrast, the payload is very often a new development for this project with no or very little previous experience. Furthermore, for science missions, it can be that a large fraction of the organisation can be from the university / research institute side with very limited organisation and qualification for a space project. The essential side can have more limited data transfer needs compared to the payload, however, the need for real-time response is obvious in most systems. The CAN-bus will meet the requirement for most essential systems. The payload requirements on the other hand will vary from mission to mission, but it is common that a lot of data is transferred to ground for further processing, or in the case of a communication satellite, there is a lot of data in both directions. It depends on the payload, but the CAN bus can meet the requirements for several types as the download bandwidth is often less the CAN bandwidth of some 500 kbit/s. The use may require the use of local buffers and/or a different system planning and layout.
2.2
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 8 of 42
Earth Sens
Gyro CNT
Star tracker
Earth Sensor
CAN
TM/TC
S-band Rx S-band Tx
Auxillary I/O
1 hot+ 1 cold
Passive CAN
Experiment A
Experiment B
Experiment C
Experiment D
Figure 2. SMART 1 satellite block level sketch for the control and communication planning
The system is quite straightforward with the major items being: Two logical buses, one for the essential side and one for the payload side Two system controllers, one is redundant Two payload controllers, one is redundant Two TM/TC units, connected two both buses Several subsystems on both buses Furthermore, on the low level electrical side, there is no very sensitive device for EMC on the TM/TC interface, nor any extreme high voltage subsystems (the Ion drive will have a few houndred volts). Furthermore, the satellite is quite small and the systems will not be separated with more than 0.1-2 meter. This means that there is no real requirement for galvanic isolation in this application, as long as the engineering is done right. If one or two intruments would really need isolation, and / or the rest of the satellite must be isolated from them, this could be solved with local isolation, not seen on system level.
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 9 of 42
3
3.1
3.2
3.3
3.4
Error rate
The error handling of CAN is one of the really strong advantages of the protocol. The error detection mechanisms are extensive, and the fault confinement algorithms are well developed. The error handling and retransmission of the messages is done automatically by the CAN hardware.
3.5
3.6
Robustness to radiation
If an FPGA is used, there should be no problems with either SEL or total dose for most missions. An Actel SX32 for instance is SEL immune and tolerates 100 kRAD and a CAN cores consumes about 50-60 % of the device.
3.7
Robustness to vibration
The robustness to vibration is not very severe and should be comparable to the use of standard components, connector and cables. If the CAN controller is implemented in a FPGA, is will probably have a quite large package that dominates as the worst component. Special care will have to be taken when mounting the package, as with any other large component. The use of connectors and cables will not be more severe compared to other interconnects on the satellite as few conductors are needed. If a modern plastic package is used, the potential vibration problems more or less disappears for the devices.
3.8
Robustness to temperature
A majority of commercial CAN components (controllers, transceivers) have operational temperature ranges between +125 C and -40 C which is within satisfactory limits for space applications.
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 10 of 42
3.9
3.10
3.11
Transceivers: ex. Philips PCA82C251 CAN controller interface Philips TJA1050 CAN controller interface Alcatel MTC-3054 CAN interface Motorola MC33388 interface The Philips TJA1050 is a successor to the PCA82C250 transceiver. The TJA1050 has a high Electromagnetic Immunity (EMI) due to a receiver with a wide common-mode range and a significantly lower Electromagnetic Emission (EME) due to optimal matching of the CANH and CANL output signals.
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 11 of 42
4.1
There exist several possibilities when using the differential wiring, the obvious ones being to use: complete standard ISO11898 commercial transcievers use Space standard RS-485 transcievers, but connected is a special way, i.e. with TX controlling the Enable function etc, as indicated in Figure 3.1.
VCC
D3 1N5822 R1 4.75k C1 U1 C11 TBJD685K050C VCC GND 1 1 2
TX
3
0.1uF R5 47.5k
EN1 EN2
GND GND 6 7
CANLA R7 150R CANHA
2 4
D1 1N4148UR-1 RX R2 4.75k
GND 1
The choice between the two should be selected on project basis, and as long as the commercial devices are not susceptible to Single Event Latchup, we see no reason why not using standard commercial devices as these are better in terms of EMC
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 12 of 42
and are tougher in terms of overvoltage, apart from being used in multibillion dollar applications. As an exemple, Philips has introduced a transciever based on SOI, which should make it excellent in radiation terms.
As can be seen in Figure 5, the use of standard transcievers simplifies the electronics quite a bit, but it is also possible to meet IOS11898 with discrete solutions, as indicated in Figure 6.
The ISO 11898 standard states that the network terminating resistors used shall have a nominal value of 120 , a maximum value of 130 , and a minimum value of 118 . This termination prevents reflected waves on the bus lines and helps drive the differential voltage to ~0V during the transmission of recessive bits on the bus. Every node should be fitted with filtering resistors on the input to prevent reflected waves being superimposed on the signal due to the internal impedance of the bus transmission lines and the stub connecting the node to the bus wires.
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 13 of 42
The use of standard devices is still recommended, if formal ESA space standards are put aside. Some devices are listed in Table 1.
It is possible that with a large number of devices on a CAN bus, buffering would be needed and one solution with a repeater is shown in Figure 7.
A repeater is a device that electrically buffers two bus sections from each other, allowing more CAN nodes to be connected to one bus. On the other hand, the use of repeaters limits the maximum baudrate and/or the maximum physical bus length, since they introduce a delay between each bus section. However, it is strongly advised by Omnisys to look again at the complete system design again, it seems likely this should be solved on a higher level.
4.2
Galvanic Isolation
Galvanic isolation of electrical units in a system is used to eliminate internal ground loops, increase the noise immunity of a system, reduce effects of electrical noise, and protect equipment and user should the unit or something in its surroundings malfunction. Isolation is also an important way to prevent static and most kinds of damaging surges in data communications systems.
4.2.1 No Isolation
With no isolation the units on the CAN network is directly connected to the busline, and it is difficult to prevent the propagation of ground loops, transients, and other disturbances in the system. The resulting reduction in components when opting for a non-isolated system in a satellite with a large network reduces overall spacecraft cost and mass in a not insignificant way.
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 14 of 42
The non isolated interface is also the most widely used, which is a very strong argument, both for access to robust devices, but also for design, testing and verification.
Fibre optics are inherently immune to electrical and magnetic interference, and the integration cost due to interference and cross-talk discovered during integration is virtually zero. Additional benefits of using fiber optics are weight reduction due to decreased electrical wire usage, and increased data rate capabilities. To this date it is however unknown how radiation will affect the optic fibre (darkening etc) when exposed to radiation during a deep-space mission. Due to the relatively recent development of fibre applications suitable for space there is a limited number of mature procedures and flight-qualified components available.
In Figure 9, a design is indicated with optocouplers on all nodes in the system. This would add a not insignificant number of devices with potential radiation problems, from SEU, SEL to total dose effects.
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 15 of 42
In figure 10, the solution with a few opto couplers is indicated, which is prefered and should only be used when absolutly necessary. If the opto coupler missbehaves, it is possible to shut the complete device down and still operate the bus, i.e. the need for the isolation is device specific and device solved.
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 16 of 42
4.3
Electrical Topology
The number of nodes that can be connected to a Controller Area Network is in theory only limited by the number of unique identifiers available (normal CAN offers 2032 identifiers, full CAN offers 536870912). In practice however the driving capabilities of the transceiver circuits limits this. A normal number of nodes that can be attached to a single bus is between 32 and 64. Philips PCA82C250/251 allows for at least 110 nodes to be connected to a single CAN network.
Figure 11. Block level drawing of a hub based realization of the CAN bus
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 17 of 42
5
5.1
R12 4.7
D3 BAS40
R3 1k
R7 27k
R8 27k
10n
R18 150k
R20 2.7k
R22 2.7k
R17 470k
Design details: The delay in/out of the drivers is determined by the selection of both the bipolar transistors and by the comparators used in the receivers. The selected devices in the schematics shown in Figure 12 allows for 1 Mbit/s transmission and more than 30 nodes. Transmitter delay: Receiver delay: Output impedance: Input impedance: Transmitter level: Receiver threshold: 30ns 70ns 10ohm 56kohm 2.7V 400mV 120mV
Because of the nature of the CAN bus, where dominant bits are driven actively and recessive bits are not, the recessive bits will always cause more transmitter/receiver delays than dominant bits. The figures given for the transceiver delays are worst case, i.e. those for dominant to recessive transition. One of the most critical aspects in most systems, if based on two redundant buses, would be to protect the system from different types of power supply faults, triggered by the tranciever. In figure 13, a safe tranceiver supply is shown, protected against overvoltage and with current limitation.
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 18 of 42
Q1 2N2907ACSM
Q2 2N2907ACSM
D3 1N5314
Vcc
8.00
UREC in Volts
14.0
8.00 2
UREC in Volts
14.0
8.00 2
UREC in Volts
14.0
4.00
10.00
4.00 3
10.00
4.00 3
10.00 3 6.00 4
6.00 4
6.00 4
-4.00
2.00
-4.00
2.00
-4.00
2.00
500N
1.50U
2.50U
3.50U
4.50U
500N
1.50U
2.50U
3.50U
4.50U
500N
1.50U
2.50U
3.50U
4.50U
Figure 14. Simulation with the three different set-ups: 5 m cable with nominal termination, 30 m cable with nominal termination and cable with 80 ohm instead of 120 ohm termination. The top trace show driver input signal, the second the driver output signal, the third the receiver input signal, and the last, the receiver output signal.
The result is shown in Figure 14, that indicates that even with 30 m cable length and with "faulty" termination, the signal transmission is perfect after reception.
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 19 of 42
10.00
18.0
10.00
18.0
10.00
18.0
14.0
1 0
14.0
1 0
14.0
UREC in Volts
UREC in Volts
-10.00
10.00
-10.00
10.00
UREC in Volts
-10.00
10.00
-20.0
6.00
-20.0
6.00
-20.0
6.00
-30.0
2.00
2.00
Figure 15. Noise with 100kHz, 1 MHz and 10 MHz, 10 V rms applied. No sensitivity can be seen.
The simulation show no design problems with RF susceptibility, but this should be complemented with real world testing for a particular implementation, is it is dependent to a large degree on cable and connector imperfections.
5.1.4 Availability
There should be no problems in availability of suitable components, all components used are available in MIL883 and or QMLQ and most could be attained with at QMLV, if so required. However, with lead time etc., it could be smart to change type on the diodes and transistors, there exist similar devices that could be easier to optain for a specific project.
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 20 of 42
5.2
A transformer isolated CAN tranceiver design is shown in Figure 17, and TX0/TX1 is used as indicated in Figure 16. It is based on discrete components and there should be no availability problems. The design is based on a standard transformer design, but this must be ordered as a custom product, depending on both real physical aspects, such as size, and on formal quality aspects used for a particular project. Transmitter delay: Receiver delay: Output impedance: Input impedance: Transmitter level: Receiver threshold: 60ns 70ns 10ohm 35kohm +/- 4V 3.15V 20mV
Because of the nature of the CAN bus, where dominant bits are driven actively and recessive bits are not, the recessive bits will always cause more transmitter/receiver delays than dominant bits. The figures given for the transceiver delays are worst case, i.e. those for dominant to recessive transition.
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 21 of 42
C3 100n
Vcc
L1 TRANS_1_2
R8 1k Tx0
One of the most critical aspects in most systems, if based on two redundant buses, would be to protect the system from different types of power supply faults, triggered by the tranciever. In figure 18, a safe tranceiver supply is shown, protected against overvoltage and with current limitation.
U12V R1 47k R5 27k R6 1k R7 30
Q1 2N2907ACSM
Q2 2N2907ACSM
D3 1N5314
Vcc
R4 10k GND
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 22 of 42
1 30.0 42.0
20.0
CANTERM in Volts URXT in Volts
32.0 2
CANTERM in Volts
20.0
URXT in Volts
32.0 2
CANTERM in Volts
20.0
URXT in Volts
32.0 2
10.00
22.0 3
10.00
22.0 3
10.00
22.0 3
12.0 4 5
12.0 4 5
12.0 4 5
-10.00
2.00
-10.00
2.00
-10.00
2.00
1.00U
3.00U
5.00U
7.00U
9.00U
1.00U
3.00U
5.00U
7.00U
9.00U
1.00U
3.00U
5.00U
7.00U
9.00U
Figure 19. Simulation with three different set-ups: 5 m cable with nominal termination, 30 m cable with nominal termination and cable with 80 ohm instead of 120 ohm termination. The top two traces show driver input signals, the third the driver output signal, the fourth, the receiver input signal and the last, the detected signal.
The result is shown in Figure 19, that indicates that even with 30 m cable length and with "faulty" termination, the signal transmission is perfect after reception.
4.00
2.00
2.00
4.00
2.00
4.00
URXT in Volts
URXT in Volts
2.00
2.00
URXT in Volts
URX in Volts
URX in Volts
URX in Volts
2.00
-2.00
-2.00
-2.00
-2.00
-4.00
-4.00
-2.00
-4.00
-2.00
10.00U
30.0U
50.0U
70.0U
90.0U
1.00U
3.00U
5.00U
7.00U
9.00U
1.10U
1.30U
1.50U
1.70U
1.90U
Figure 20. Noise with 100kHz, 1 MHz and 10 MHz, 10 V rms applied. No sensitivity can be seen.
The simulation show no design problems with RF susceptibility, but this should be complemented with real world testing for a particular implementation, is it is dependent to a large degree on cable and connector imperfections.
5.2.3 Availability
For the transformer, this must be ordered separately from a qualified source, but for the other components, there should be no problems in availability, all components used are available in MIL883 and or QMLQ and most could be attained with at QMLV, if so required. However, with lead time etc., it could be smart to change type on the diodes and transistors, there exist similar devices that could be easier to optain for a specific project
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 23 of 42
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 24 of 42
5.3
These four factors lead to numerous advantages. Firstly, by decreasing the resistance Rds (on) by up to 20 %, A-BCD1 generates far less heat than equivalent bulk silicon processes, reducing or eliminating the need for heat sinks and keeping costs down. And with this low Rds (on) , SOI gives DMOS transistors excellent power handling capabilities, allowing designers to choose for the same size of chip of lower heat dissipation, or higher current handling ability, or a smaller chip with the same dissipation. The end result of this choise has, in one example, allowed stand-by power consumption to be reduced. Secondly, with no junctions between the n- and p-type devices and the substrate, SOI is intrinsically free from latch-up (associated with the overloading of bulk silicon transistors) and virtually eliminates problems arising from cross-talk via the substrate, load dump and other accidental high external voltages. These features make SOI inherently reliable and also allow for easy integration of multiple power devices, bridge rectifiers and flyback diodes on the same piece of silicon. When combined with CMOS, Bipolar, JFET and DMOS SOI devices, these advantages enable the creation of real smart power circuits.
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 25 of 42
Thirdly, the packing densities achieved by SOI represent a major advance on bulk silicon, enabling size reductions of up to 30 %: next generation A-BCD2 (EZ-HV) devices should be able to improve on this still further, as well as offering even lower Rds (on). Lastly, many of SOI's advantages come from the isolation of the components in the oxide layer, ensuring excellent insulation and, as a result, a significant reduction in parasitic capacitances, leading to quicker and easier design-in. Eliminating latch-up and parasitics provides other benefits as well, such as protection from voltage spikes. Because A-BCD1 does not need reverse-biased junctions to isolate components, leakage currents are avoided and so SOI has much greater heat tolerance, up to 160 C instead of the normal 125 C for bulk silicon. Comments from Omnisys: The use of SOI technology should remove two of the radiation problems of integrated circuits, the Total Dose degradation and the Single Event Latch-up, but this should be verified by either analyses or test. Furthermore, the probability of other types of single event faults should be reduced. As Philips is likely not to share information about their process, testing will be needed in that case.
5.3.2 Availability
Volume production is now running. Philips have some 80 % of the market share for CAN bus tranceivers (Anders Lundquist, Mecel, verbal communication), and while most systems now are based on "older" devices most new projects are designed around the TJA1050. The TJA1050 is available in plastic SO-8 in commercial and automotiv grade, and as tested but not screened naked die. Minimum, quoted volume, is 1000 devices for packaged devices and 2000 for naked die (Promax). The packaged devices should be available with smaller miminum volume shortly through other distributors. The cost is about 1 ECU per device, naked as well as packaged.
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 26 of 42
used by nodes that don't need to send any data, but can also be used to prevent network communication from being blocked etc,
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 27 of 42
1. The loop delays of the connected bus nodes (CAN controller, transceiver etc.) and the delay of the bus line 2. The relative oscillator tolerance between nodes 3. The signal amplitude drop due to the series resistance of the bus cable and the input resistance of bus nodes. Effects 1 and 2 determine a value for the maximum bus line length with respect to the CAN bit timing. Effect 3, on the other side, determines a value with respect to the output signal drop along the bus line. The minimum of the two values has to be taken as the actual maximum allowable bus line length. As the signal drop is only significant for very long lengths, effect 3 can often be neglected for high data rates. A data rate of 500 kbit/s gives a maximum length of 107 m. The calculation is based on effects 1 and 2 assuming an oscillator tolerance of better than 0.15%. Notice that the stated values apply only for a well terminated linear topology. Bad signal quality because of improper termination can lower the maximum allowable bus length. For most satellite applications, 1 Mbit data rate should not be any problem, as the most often better crystals are used and 30 m cabling would be considered very long.
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 28 of 42
5.4
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 29 of 42
System Redundancy
One important aspect is different redundancy concepts for the bus hardware. We discuss two concepts; warm and cold redundancy setup operates, both with majority voting. One alternative is described each for the cold and warm redundancy, several options exist in both cases, but this can be useful as examples.
6.1
Cold Redundancy
The basic system idea: Single CAN core in each node
Dual transceivers in each node Watchdog in each node, possible to override from local CPU Watchdog is reset through special CAN message When the watchdog times out, the CAN node toggles to the other transceiver. Watchdog times out after one time unit
PWR:A PWR:B PWR:C TC:A TC:B TM:A TM:B
CPU:A
CPU:B
CPU:C
Start-up Sequence
Possible TC on The PWR units power up CPU:A Waits for CPU:A OK, 10 time units CPU:A tries bus A 5 time units, bus B 5 time units The PWR units shuts down CPU:A, powers up CPU:B CPU:B tries bus A 5 time units, bus B 5 time units Cycle can continue, i.e. A-B-C, send TC, A-B-C etc. If CPU:X starts, it transmits Watchdog Reset signal on the established bus CPU:X sends PWR ON commands to PWR:A,B,C for the different subsystems
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 30 of 42
When the Master CPU has turned on the sub-systems, it checks the operation of each device, for example demanding I am OK for each within a certain period of time. If any device does not respond, the Master CPU can decide to try establishing contact on the redundant bus. After the initial check the Master CPU can relinquish bus monitoring to the individual nodes by transmitting a periodical Watchdog Reset signal to every node on the established bus. The bus communication can fail due to either a physical problem (such as a break, or by a bus wire short to either ground or supply voltage), however a bus switch might also be recommended if failures of for example the CAN transceivers located in one of the nodes occur. Bus Failure 1 1. CPU:X stops transmitting Watchdog Reset signal to PWR:ABC on BUS:X due to failure of CAN transceiver or CAN controller. 2. The system goes to start-up again (see Start-up Sequence). Bus Failure 2 1. BUS:X is broken between CPU:X and PWR:ABC 2. The Watchdog Reset signal is lost and the system goes to start-up again. Communication will be established on non-broken bus Bus Failure 3 1. BUS:X is broken between CPU:X and TC:A 2. Watchdog Reset signal is lost and TC:A starts toggling between buses, TC:B can possibly still be used 3. Master CPU is monitoring BUS:Y and detects traffic. 4. Master CPU transfers TC:A back to BUS:X to make sure the transfer not was provoked by a transient problem, either by not transmitting a Watchdog Reset signal on BUS:Y or by transmitting a forcing CAN message. 5. If Master CPU decides to switch bus, the Watchdog Reset signal on BUS:X is terminated and transferred to BUS:Y. 6. All sub-systems will establish contact with BUS:Y. Bus Failure 4 1. 2. 3. 4. BUS:X is broken between CPU:X and Node:X. The Watchdog Reset signal is lost and Node:X toggles to BUS:Y Master CPU detects traffic on BUS:Y Master CPU transfers Node:X back to BUS:X to make sure the transfer not was provoked by a transient problem. 5. If Master CPU decides to switch bus, the Watchdog Reset signal on BUS:X is terminated and transferred to BUS:Y. 6. All sub-systems will establish contact with BUS:Y Bus Failure 5 1. Node:X stops receiving Watchdog Reset signal on BUS:X, due to CAN transceiver or CAN controller failure. 2. Node:X toggles to BUS:Y 3. Master CPU detects traffic on BUS:Y 4. Master CPU transfers Node:X back to BUS:X to make sure the transfer not was provoked by a transient problem, either by not transmitting a Watchdog Reset signal on BUS:Y or by transmitting a forcing CAN message. 5. If Master CPU decides to switch bus, the Watchdog Reset signal on BUS:X is terminated and transferred to BUS:Y. 6. All sub-systems will establish contact with BUS:Y
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 31 of 42
Bus Failure 6 1. BUS:X is shorted to ground or supply voltage 2. All users on the bus will detect form errors, bit stuffing violations, and errors in the reception of self-transmitted messages. This will prompt them to transfer communications to BUS:Y. 3. The Master CPU transfers the users back to BUS:X to make sure the transfer was not provoked by a transient problem. The users toggles back to BUS:Y. 4. The Master CPU terminates the Watchdog Reset signal on BUS:X and start transmitting it on BUS:Y.
6.2
Warm Redundancy
Instead of letting the Master CPU handle the monitoring of the bus and the signalling of bus transfer (Watchdog Reset signal) the system could be designed to function with two physical buses working in parallel. The nodes send a periodic Im OK signal on the bus.
Dual transceivers in each node Receiver stage AND gate Master CPU watchdog that is reset every time an Im OK signal is received. When the Master CPU Watchdog times out the system is re-started Watchdog in each node, possible to override from local CPU Watchdog is reset at every signal level transition When the watchdog times out, the CAN node toggles to the other transceiver. Watchdog times out after T = (Tstuff +1) + 2xTerr.
The maximum dominant time allowed in a the data field is limited by a stuffing rule that states that after 5 consecutive bits of the same polarity a bit of opposite polarity must be inserted in the bit stream. This time (5*bit time) is Tstuff. When detecting a violated stuffing rule the CAN controller starts to issue error flags in the bit time following Tstuff +1. These error flags can last up to 2xTerr . Monitoring of a stuck-to-dominant fault is thus easily achieved using a watchdog timer that
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 32 of 42
is set to Tsd = (Tstuff+1) + 2xTerr. (Tsd being the time a stuck-at-dominant failure is detected) The system goes through the same start-up sequence as a cold redundant system. Bus Failure 1 1. BUS:X is broken between CPU:X and PWR:ABC 2. The receiver from BUS:X supplies one of the inputs of the AND gate on the receive stage with an idle signal, i.e. a logic HIGH. This has no affect on the signal transmitted to the PWR:ABC CAN controller, as the bus signal from BUS:Y then controls the gate. Bus Failure 2 1. BUS:X is broken between CPU:X and TC:A 2. TC:A continues transmission using BUS:Y. The broken BUS:X supplies one of the inputs of the AND gate on the receive stage with an idle signal, i.e. a logic HIGH. This has no affect on the signal received by TC:A, as the bus signal from BUS:Y then controls the gate. Bus Failure 3 1. BUS:X is broken between CPU:X and Node:X. 2. Node:X continues transmission using BUS:Y. The broken BUS:X supplies one of the inputs of the AND gate on the receive stage with an idle signal, i.e. a logic HIGH. This has no affect on the signal received by Node:X, as the bus signal from BUS:Y then controls the gate. Bus Failure 4 1. BUS:X is shorted to ground. 2. The transmission of data and commands are continued via BUS:Y. 3. The nodes receivers see a constant logic LOW, i.e. a dominant bit on the bus, and after Tsd the Watchdogs time out. 4. The input to the receivers AND gates is toggled to a logic HIGH, to remove the influence of the faulty bus on the individual receiver stages, and the system is controlled by the contents of BUS:Y. Bus Failure 5 1. 2. 3. 4. BUS:X is shorted to supply voltage. The transmission of data and commands are continued via BUS:Y. The receivers see a constant logic HIGH, i.e. a recessive bit on the bus. The AND gates on the receiver stages are unaffected by this and the contents of BUS:Y is coming through to the CAN controllers.
Bus Failure 6 1. The AND gate on CPU:A is damaged. 2. CPU:A loses the Im OK signal from the nodes. 3. The Master CPU watchdog times out and the system goes to start-up again
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 33 of 42
Bus Failure 7 1. The AND gate on Node:X is broken. 2. The CAN controller in Node:X takes the Node into off-bus state, the bus is unaffected. Bus Failure 8 1. The AND gate in TC:A is damaged. 2. The CAN controller in TC:A takes TC:A off-bus. 3. TC:B is still functioning and the operation of the bus is unaffected. Bus failure 9 1. The AND gate in PWR:A is broken. 2. The CAN controller in PWR:A takes it off-bus. 3. PWR:BC are still functioning and the operation of the bus is unaffected.
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 34 of 42
7
7.1
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 35 of 42
7.2
7.2.1 Identifiers
The CAN-bus is a multimaster bus with deterministic behaviour with message priority based on a message identifier. The use of this identifier controls much of the behaviour of a distributed control system based on the CAN-bus. In CAN 2.0B, the identifier is 29-bits long and below is a suggestion how to use these address bits. Using the two most significant bits, we divide the messages into four types:
Supervisory Mode Command Mode - high priority Command Mode - standard Data transfer
Bit 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 Supervisory 1 1 Priority Priority Priority Priority Priority Priority Priority Priority Adress Adress Adress Adress Adress Adress Adress Adress Command Command Command Command Command Command Command Command Command Command Command Command 1 0 Priority Priority Priority Priority Priority Priority Priority Priority Adress Adress Adress Adress Adress Adress Adress Adress Command Command Command Command Command Command Command Command Command Command Command Command 0 1 Priority Priority Priority Priority Priority Priority Priority Priority Adress Adress Adress Adress Adress Adress Adress Adress Command Command Command Command Command Command Command Command Command Command Command Data 0 0 data type data type data type record ID record ID record ID record ID record ID record ID record ID record ID record ID record ID record ID record ID record ID record ID record ID record ID record ID record ID record ID record ID record ID record ID record ID record ID
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 36 of 42
7.2.2 Modes
Supervisory mode The supervisor mode is only used at extraordinary events such as system startup, subsystem fault debugging, system reconfiguration etc. It could also be used to override the system from ground via the telecommand/telemetry system. Examples are: forced reboots hard interrupts setting of hard security levels, max current etc. remote code loading remote boot with code download direct monitoring Command Mode High Priority High priority commands involves essential systems and operations, for example real-time commands between sensors and actuators on a three-axis stabilized satellite. Normal Priority Normal priority commands involve non-time critical operations, such as deployment of solar cell arrays, opening/shutting blinds, battery charging. Data mode In Data Mode, users get assigned a record ID and a record length. One or more storage devices record the transmitted data along with the record ID when ordered. When the record is full, the users get a new record ID. The transmit-onaddress scheme allows for transparent use of redundant data storage devices. Storage Record Format: (24-bit identifier + 5-bit data type code) * 8-byte records Data type code #000 indicates long record and the next 32-bit word gives the record length N x Bytes data Maximum of 16 million records Maximum record length: 4 GByte Maximum number of minimum records in 512 MByte: 4.5 M records with only 66 % efficiency. 64 Byte records give 94 % efficiency. Full data rate on CAN is about 8000 msgs/second: fills up 512 MByte in 1.5 h. Example with one Storage Device, One Payload Master and several instruments. 1. Payload Master to Storage Device: Record On Record ID: #00FF01 and length: #8F (128 x 8-byte) 2. Payload Master to Instrument One: Take 128 samples of 4x16 bits with 10 seconds interval Use Record ID: #00FF01 and length: #8F Start Now 3. Payload Master to Storage Device: Record On Record ID: #00FF06 and length: #00, #0000FFFF (64 kByte) 4. Payload Master to Instrument Two: Take picture Use Record ID and length: #00FF06, #00, #0000FFFF (64 kByte) Snap Now
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 37 of 42
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 38 of 42
8
8.1
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 39 of 42
Earth Sens
Gyro CNT
Star tracker
Earth Sensor
C AN
TM/TC
S-band Rx S-band Tx
Auxillary I/O
1 hot+ 1 cold
Passive CAN
Experiment A
Experiment B
Experiment C
Experiment D
PPC603e 32-bit processor (200 MIPS, 50 MFLOPS) 64/70 Mbyte DRAM main memory EDAC Radiation tolerant Boot PROM 512 Mbyte EDAC DRAM in banks possibly 1-8 Mbyte EEPROM (depends on radiation tolerance) CAN-bus interface (dual redundant)
This subsystem could be configured to run both as Main controller and as Payload controller.
32-bit microcontroller (StrongArm, Leon-1) EDAC DRAM/SRAM CAN-bus interface (dual redundant) with remote boot Configured to run small operating system
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 40 of 42
8.3
Essential Communication
20 Hz update data readout transaction with 8 data words each (0.16 ms) control transactions with 8 data words each spare transactions 100 us set-up / latency 10 transactions of 10 16-bit words at 1 Mbit/s = 10*(10*16 +100)*1e-6 = 2.6 ms 20 Hz is 50 ms period, 2.6 ms/50 ms = 5.2 % of available bandwidth
8.4
Payload Communication
The payload communication is handled through redundant CAN-busses, each operating at 1 Mbit/s. The CAN system can handle up to about 8000 msgs/s, and about 400 kbit/s in data bandwidth. This would probably be more than the downlink can handle. Local buffering should be used to even out burst acquisitions.
8.5
System priority
There must be a structured functional architecture defined in the system with assigned priorities and levels of command. In Figure 25, a general division is made between different levels. This must of course be refined and modified depending on mission, modes etc.
The system is based on an autonomous power system that always maintains a regulated main bus voltage according to standards suggested by ESA. To switch different redundant sub-systems on and off, some form of intelligence is required, in our case a Lizard Brain function. It could consist of voting simple CPUs with compact software but other possibilities exist. The main task is to boot up and monitor the functions of the System Controller and the TM/TC subsystems. It will also function as the interface to the power system. When one TM/TC system and one System CNT are operational, they start by loading "Basic Services" functions,
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 41 of 42
such as direct "terminal mode" from ground for the TM/TC system and applying basic attitude control for the System CNT. This could be defined as a safe mode type of operation. These services should be implented as high priority processes/drivers. When all lower levels functions correctly, System applications can operate. These provide more intelligent control, data handling, on-board decision making etc. On top of the System applications, payload control payload applications run.
8.6
Start-up
1) Power system (OK) 2) Lizard Brain (16-bit CPU, Triple majority voting CPU or triple majority voting watchdog with cold redundant CPU) 3) Lizard Brain controls Main controller (Polling boot; Main, Payload, Redundant) (3.b starting telemetry safe mode, ACDC safe mode) 4) Main controller monitors and controls essential subsystems 5) Essential subsystems, ACDC, thrusters, . 6) Main controller starts (controls) payload controller 7) Payload controller controls payload 8) Payload Boot procedure: 1) Power bus voltage established (all subsystems off) 2) Lizard brain failsafe boot (Tripple majority voting CPU or Tripple majority voting watchdog with cold redundant CPU) 3) Lizard brain boots (power on) on Main controller [Main, Payload, Redundant] 4) Power on: wait 1 sec, if no answere on 1553, go to next Controller 5) If all three fail, start Telemetry system in failsafe mode as 1553 Bus controller 6) Main controller starts essential subsystems 7) Possible polling start of redundant subsystems 8) Main controller starts Payload controller 9) Main controller enables Payload instruments
URL: www.omnisys.se Tel. +46 31 7096970 Fax. +46 31 7096979 Date: 11/07/01 Pages: 42 of 42
References
Ref 1 ISO 11898:1993, Road vehicles Int erchange of digital information Controller area network (CAN) for high-speed communication, Nov 1993 Ref 2 Ref 3 Statement of Work, ESTEC, April 1999 Packet Telemetry, CCSDS 102.0-B-4 Blue Book, Nov 1995 ftp://ftp.estec.esa.nl/pub/ws/wsd/CAN/canspace.htm http://www.canhug.org/ http://www.kvaser.org/ http://www.can-cia.de/ http://www.odva.org/ http://content.honeywell.com/sensing/prodinfo/sds/ Smart Distributed System (Honeywell). http://www.bosch.de/de_e/productworld/k/ products/prod/can/Bosch http://141.44.61.248/NT/CAN/Welcome.html IPE CAN Home Page, University of Magdeburg. http://www.docs.uu.se/~ken/ http://www-us.semiconductors.philips.com/can/ http://www.mfuniversity.com/siemens/homepage.htm http://www.algonet.se/~staffann/developer/CAN.htm http://www.can-cia.de/pc.htm http://www.can-cia.de/pth.htm
CAN for space CAN HUG CAN Kingdom CiA ODVA SDS Bosch IPE CAN Ken Tindell Philips Siemens Institute Staffan's CAN page CAN controllers CAN tranceivers
10