Advanced PostgreSQL in Django

  • by Christophe Pettus

PostgreSQL is the most advanced open-source database on the planet, but (except for GeoDjango) few Django developers take advantage of its more advanced features. We want to change that!

We’ll talk about custom types (including Admin integration), replication tricks, specialized indexes, unstructured types such as JSON, stored procedures, and other ways to get maximum functionality and performance out of your PostgreSQL data store.

Database Agnosticism

  • Django has to be able to handle all databases.
  • That means it loses special features of individual database systems (PGSQL, MySQL, etc).
  • Database migration is a once in a lifetime thing.

What can PostgreSQL do for you

  • PostgreSQL has a huge range of built-in types
  • Most can be used natively in Django
  • You can find libraries to support them.

citext

  • case insensitive text
  • ignores case on comparisons for a field

hstore

  • built-in dict-like structure
  • Maps to Python dict
  • Can be indexed and queried for inclusion
  • requires custom sql

json

  • All the cool kids use NoSQL databases
  • Something faster than MongoDB to store your JSON databases
  • validated going in
  • Not feature rich, but it’s growing

Others

  • UUID
  • IPv4 and IPv6
  • Interval - stores time intervals directly in the database
  • See Craig Kersteins’s talk later

Where do we sign up?

Getting it work in Django

  • Adapt to psycopg2

    • take string representation of the type and convert to and from the python object.
  • Write a Field class for Django

    • subclass django.forms.Field
  • Create a widget

TODO - ask Christophe for the slides so I can finish this section

Indexes

Django models are great but it’s index creation syntax is somewhat lacking. Multi-column indexes for example

Partial indexes in Django are good:

CREATE INDEX active_orders ON cart_order(status) IF status == 1

Custom Constraints

  • Django’s database agnosticism is a problem for things like foreign-key handling
  • It’s also boring

Tips:

  • Push database constraints into the database whenever possible.

Raw SQL

The three-join rule:

  • If you are joining more than three tables, use raw SQL
  • It turns out that PostgreSQL is really good at multiple joins
  • Don’t fall back into iterating over querysets to get data. Use SQL!

Tips:

  • Put RAW SQL in the model or manager. Put it in the view and you risk losing it
  • Use South
  • Don’t be constrained by fear of migrating from one database to another. Choose and use special features