Table Column Column Hamburgers Item Topping Cheeseburger Tomatoes Mushroomburger Swiss
Could be “simplified” (“normalized”) as:
Table Column Column FoodTypes ID Name 1 Hamburger 2 Topping Food Item TypeID Cheeseburger 1 Mushroomburger 1 Tomatoes 2 Swiss 2
Recently I’ve gone over the deep end with this approach, abstracting and re-abstracting a fairly complex database design until I was left with something both very simple and yet completely un-resembling of the actual data being stored. This has led me to the conclusion that all databases could be “summarized” in a single monstrous table called “Entries” with columns:
ID Type Value1 Value2
For example:
ID Type Value1 Value2 4321 Item 8746 Descrip 4321 Food 5673 Item 9876 Descrip 5673 Hamburger 0341 Item 1234 Descrip 0341 Lettuce 5478 Relation 5673 0341 2381 Descrip 5478 Topping 2244 Relation 5673 4321 2160 Descrip 2244 Class 4436 Relation 0341 4321 7547 Descrip 4436 Class
Here, using these 4 columns in 1 table, I have created two objects sharing a common superclass, given them an attribute, and defined not only a relationship between them but the class of that relationship as well. We could now say “Lettuce is a Topping of Hamburger, both of which are Foods”. There would of course be a set of rules for this system, but that is beyond the scope of this question. My question is, is this not logically the case? If so (or if there is a different, “correct” answer), what is this in relation to real databases? Does such a system exist, or should it not? I’m not sure if I’ve gone far enough in my analysis, and I feel like I’m on the verge of some insight which is profoundly obvious to mathematicians and computer scientists (like “yes, all relational data can be described in terms of binary operands like F[a, b]).
Asked By : SG1
Answered By : András Salamon
Mr. Kempe’s conception depends upon considering the diagram purely in its self-contained relations, the idea of its representing anything being altogether left out of view; while my doctrine depends upon considering how the diagram is to be connected with nature.
In terms of modern database representations, RDF corresponds to Peirce’s ternary relations, while key-value stores correspond to Kempe’s graphs. As Raphael pointed out in a comment, these are essentially the ideas behind graph databases. As an aside, the arguments about binary/ternary/quaternary/quinternary/… tuples being the “best” foundation for data representation still rage on in the W3C semantic web working groups. I believe that the original insight of the relational model was that for efficiency, one should use relations of the right arity; relationships that are naturally represented as tuples should not be transmogrified into more complicated data structures just for reasons of purity. However, this argument also goes the other way: relationships where different instances naturally have different arities should not be shoehorned into a rigid relational schema just to preserve the purity of the data model.
- A. B. Kempe, A Memoir on the Theory of Mathematical Form, Philosophical Transactions of the Royal Society of London 177, 1886, 1–70.
- C. S. Peirce, The Critic of Arguments (1892 essay), in Collected Papers of Charles Sanders Peirce, Vol. III, Harvard University Press, 1933, 250–265.
Best Answer from StackOverflow
Question Source : http://cs.stackexchange.com/questions/12060