database greenhorn

PoisonedPrisonPanda@discuss.tchncs.de · edit-2 1 year ago

database greenhorn

PoisonedPrisonPanda@discuss.tchncs.de · 1 year ago

broadly you want to locate individual records as quickly as possible by using the most selective criteria

What can be more selective than "if ID = “XXX”? Yet the whole table still has to be reviewed until XXX is found?

… and to familiarize yourself with normalization.

based on a quick review of normalization, I doubt that this helps me - as we are not experiencing such links in the data. For us we “simply” have many products with certain parameters (title, description, etc.) and based on those we process the product and store the product with additional output in a table. However to not process products which were already processed, we want to dismiss any product which is in the processing pipeline which is already stored in the “final” table.

It isn’t just a big bucket to throw data into to retrieve later.

thats probably the biggest enlightment I have got since we started working with a database.

Anyway I appreciate your input. so thank you for this.

normalexit@lemmy.world · 1 year ago

If you are searching by a primary key or other indexed id you should be fine. Here are a couple of articles to check out:

https://www.atlassian.com/data/databases/how-does-indexing-work

https://www.red-gate.com/simple-talk/featured/postgresql-indexes-what-they-are-and-how-they-help/

The TLDR is a where clause that hits an index doesn’t have to go through all the rows in the table.