Corruption Detection and Containment

Cite

PGCon - PostgreSQL Conference for Users and Developers, Andrea Ross

PGCon - PostgreSQL Conference for Users and Developers

Formal Metadata

Title

Corruption Detection and Containment

Alternative Title

Survey - Error: invalid page header in block 123 of relation "foo"

Title of Series

PGCon 2013

Number of Parts

Author

PGCon - PostgreSQL Conference for Users and Developers

Contributors

Heroku (Sponsor)

License

CC Attribution - NonCommercial - ShareAlike 3.0 Unported:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal and non-commercial purpose as long as the work is attributed to the author in the manner specified by the author or licensor and the work or content is shared also in adapted form only under the conditions of this

Identifiers

10.5446/19044 (DOI)

Publisher

PGCon - PostgreSQL Conference for Users and Developers, Andrea Ross

Release Date

2013

Language

English

Production Place

Ottawa, Canada

Content Metadata

Subject Area

Computer Science

Genre

Conference/Talk

Abstract

This will not be the most exciting talk, nor is there (currently) a simple answer to make hardware corruption problems go away. But it's important -- without being careful, it's easy for corruption to spread to replicas and backups, leaving data hopelessly lost. Or, a strange crash due to corruption could take many engineering resources to analyze. This talk is about kinds of hardware corruption that can and do happen, and the ways to detect and contain the corruption as quickly as possible. Additionally, we'll discuss a roadmap of improvements to postgresql to make this an easier process; as well as alternatives (such as detecting corruption in the filesystem). Note: Some storage systems do provide strong protections against data corruption. This talk is primarily (though not exclusively) targeted at users of the local filesystem, particularly on Linux. These are the topics that will be addressed: Why not deal with this in the filesystem? The different kinds of corruption. When to detect the corruption, and how to contain it. Data page checksums Backups and corruption Replication and corruption Background and offline detection More work to be done