Manipulating text with PostgreSQL - lesser known PG jewels

Cite

Related Material

FOSS4G

Open Source Geospatial Foundation (OSGeo)

Picavet, Vincent

Formal Metadata

Title

Manipulating text with PostgreSQL - lesser known PG jewels

Title of Series

FOSS4G Firenze 2022

Number of Parts

351

Author

Picavet, Vincent

License

CC Attribution 3.0 Unported:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.

Identifiers

10.5446/69132 (DOI)

Publisher

FOSS4G

Open Source Geospatial Foundation (OSGeo)

Release Date

2024

Language

English

Production Year

2022

Content Metadata

Subject Area

Computer Science

Genre

Conference/Talk

Abstract

PostgreSQL is the most advanced opensource RDBMS. As GIS folks, you most probably use it in combination with PostGIS, its Geospatial plugin. When dealing with Geospatial data, we usually focus on geometries. But most of feature attributes are text data. Of course, filtering on these text data with standard SQL capabilities is a day-to-day operation for database users. But PostgreSQL provides much more capabilities when it comes down to text data management. In this presentation, we will go through a few of them. After a quick look at standard text functions in PostgreSQL, we will discover the lesser known fuzzy matching modules : - `pg_trgm` extension allows for string searches using trigraphs to determine a similarity rank between text items - `fuzzystrmatch` extension provides fuzzy matching functions like soundex, Levenshtein, metaphone Then, we will explore *Full Text Search ( FTS )* PostgreSQL capabilities. Last but not least, we will peek inside PostgreSQL collation concept, which has nothing to do with your lunch. Collations are a powerful feature in PostgreSQL allowing to adapt the way you deal with text data according to the localization. Like trying to answer this - apparently - obvious question : is '12' before or after '2' ? And, because we can, display all of this on a map :-)

Keywords

foss4g2022

generaltrack

UsecasesAndapplications