Distributed Database
What Is A Distributed Database?
A distributed database is one in which data is kept in several physical places. It might be stored on numerous computers in the same physical location or spread across a network of interconnected computers.
Deeper Definition
A decentralized database distributes the burden over numerous workstations and employs advanced algorithms to balance incoming and outgoing queries for the fastest response time. This form of database is beneficial when more data has to be kept in the database than recorded on a single physical system. Database developers must choose between rapid and consistent replies.
In general, distributed databases incorporate the following characteristics:
• Location independence • Distributed query processing • Distributed transaction management • Hardware independence • Operating system independence • Network independence • Transaction transparency
There are several good reasons for splitting up a database:
Demand: When a large number of users attempt to access the data at the same time, database performance degrades. By dividing the burden, more computers can respond to more requests while users experience no performance delays.
Data Size: At writing, the most extensive commodity disk drives available are 18 terabytes. Some data sets are too huge to fit on a single disk, and these data sets must be distributed over numerous hard disks.
Drives can fail, which needs redundancy. If the data is essential, making numerous copies and storing them on other devices safeguards against hardware failure.
Privacy: Some data sets are segmented to maximize confidentiality and reduce risks in the event of a data breach. If separate pieces of the data are kept on different machines, even if one component is compromised, the remainder of the data remains safe.
Speed: Network latency remains an issue when the database and the user executing queries are geographically apart. Because the data does not have to travel as far, placing copies of the data in centers near the user leads to speedier answers. The need for speed is especially significant for initiatives that include personnel from various continents.
Distributed Database Example
Though several distributed databases are available, some examples are Apache Ignite, Apache Cassandra, Apache HBase, Couchbase Server, Amazon SimpleDB, Clusterpoint, and FoundationDB.
« Back to Glossary Index