Sei sulla pagina 1di 4

Introduction to Distributed Data

Processing (DDP)
• Movement and structure of data around
organisations
G53DBC
• Range of data processing approaches
– Classified according to degree of
Distributed Data Processing ‘Centralisation’
– Currently, most businesses employ hybrid
approaches
– Client/Server architectures

Centralized Data Processing Distributed Data Processing


(CDP) (DDP)
• Centralised computers, processing, data, • Computers are dispersed throughout
control, support organisation
• What are the advantages? • Allows greater flexibility in structure
– Economy for equipment and personnel • More redundancy
– Lack of duplication • More autonomy
– Ease in enforcing standards, security

Why is DDP Increasing? DDP Pros & Cons


• Dramatically reduced hardware costs • There are no complete solutions
• Increased desktop power
• Improved user interfaces (!) • Key issues
• Ability to share data across multiple servers – How does it affect end-users?
– How does it affect management?
– How does it affect productivity?

1
Benefits of DDP (1) Benefits of DDP (2)
• Responsiveness • End-user Productivity
• Availability • Distance & location independence
• Organisational Patterns • Privacy and security
• Resource Sharing • Vendor independence
• Incremental Growth
• Increased User Involvement & Control

Drawbacks of DDP Reasons for DDP


• Difficulties in failure diagnosis • Need for new applications
• More components and dependence on – On large centralised systems, development can take
communication means more points of failure years
• Incompatibility of components – On small distributed systems, development can be
• Incompatibility of data component-based and very fast
• More complex management & control • Need for short response time
• Difficulty controlling information resources – Centralised systems result in contention among users
and processes
• Suboptimal procurement
– Distributed systems provide dedicated resources
• Duplication of effort

Hybrid processing approaches Client/Server Architecture


• Centralized systems (mainframes, etc) • Combines advantages of distributed and
• Distributed systems (PCs) centralized computing
• Networked systems • Cost-effective, achieves economies of scale
• Client-Server computing • Flexible, scalable approach

2
Intranets Internet
• Often use Internet standards • Provides access to outside users
• Content is accessible only to internal users – Customers
• A specialized form of client/server – Vendors / suppliers
architecture
• ‘Extranets’
– extensive, controlled, secure access
– usually mapped to a particular WAN

Distributed applications Keeping data


• Horizontal partitioning • Tailored to size of business
– Different applications on different systems – Small businesses can often rely on a collection
– One application replicated on systems of files (e.g. text and numerical data)
– Example: Office automation – Large businesses will often rely on one or more
databases
• Vertical partitioning – Distributed organisations will often need to
– One application dispersed among systems distribute databases
– Example: Retail chain • I/O
• Store

Distributed databases (1) Distributed databases (2)


• Centralised databases • Replicated databases

– No duplication of data – Less data contention


– Good for security and consistency – High/replicated storage & data update costs

– Contention for access – Three variants of replication


– Response times poor • Real-time (two-phase commit, on entry)
– Breakdowns propagate to local sites • Near Real-time (batch backups, e.g. every 30 mins.)
• Deferred (bulk transfer, once or twice per day)

3
Distributed databases (3) Networking Requirements of DDP
• Partitioned databases • Connectivity requirements
– What links between components are necessary?
– No duplication, limited contention
– Disperses load • Availability requirements
– Graceful degradation? – Percentage of time application or data is
available to users
– Ad hoc reports more difficult to assemble
– Maintaining separation as data evolves difficult • Performance requirements
– Response time requirements

Summary
• CDP and DDP
• Benefits and pitfalls of DDP
• Architectures for hybrid approaches to DP
– Client/Server architecture
• Distributed applications
• Distributed data
• Networking requirements
• Next time: Intranets and Internet

Potrebbero piacerti anche