A set is a collection of different objects. Some basic operations from the Theory of Sets are the set membership (∈), subset (⊆), intersection (∩), and difference (−). The relational Algebra adapts the set operations to work with relations. However, as we show in this work, the set operations have limitations because of the implicit use of the identity predicate. That is, a tuple is a member of a set if it is identical to any tuple in the set. For example, let's consider two relations. The first one is a list of products that a person wants to buy. The second one is a list of products that one store has. Now, we could get any item from the desired products list and query "can we buy this item in the store?" with the set membership operator (∈), being true if the item is a member of the second set or false if not. With the set membership operator as a basis, we can also perform other queries such as subset, intersection, and difference. The subset (⊆) query would answer to "can I buy all the desired products in the store?". The intersection (∩) would answer to "what products can I buy in the store?" And finally, the difference (−) would answer to "what are the desired products that I cannot buy in the store?". Still, many applications need other comparison predicates that are not limited to identity. For example, if we add quantity and price to the sets of desired products and store's products, comparing the tuples by identity won't have much sense, since a product in the store with stock greater than the required should be valid, and it is also valid a product with a price lower than the user's maximum budget for that product. This MSc work presents the new Relational Conditional Set Operations. The novel operators encapsulate the idea of set operations with conditional queries, facilitating specific operators for them, and allowing their optimization. For example, they are potentially useful in applications of product sales with units and prices, job promotions with skills that have enough experience or certification level, and internships with minimum grades. We validate our proposal's semantics and scalability by studying the first of these applications. Also, we open path for future works such as: to implement the operators in a DBMS; to propose SQL queries able to answer these kind of queries and compare it with our current approach; to extend the idea for bag algebra; to explore a whole new path of optimization for our algorithms; to add support for complex data, allowing similarity comparisons in the predicate; and, to study the use of these operators as basis for other operations that currently use the traditional set operation as basis; among others.
A set is a collection of different objects. Some basic operations from the Theory of Sets are the set membership (∈), subset (⊆), intersection (∩), and difference (−). The relational Algebra adapts the set operations to work with relations. However, as we show in this work, the set operations have limitations because of the implicit use of the identity predicate. That is, a tuple is a member of a set if it is identical to any tuple in the set. For example, let's consider two relations. The first one is a list of products that a person wants to buy. The second one is a list of products that one store has. Now, we could get any item from the desired products list and query "can we buy this item in the store?" with the set membership operator (∈), being true if the item is a member of the second set or false if not. With the set membership operator as a basis, we can also perform other queries such as subset, intersection, and difference. The subset (⊆) query would answer to "can I buy all the desired products in the store?". The intersection (∩) would answer to "what products can I buy in the store?" And finally, the difference (−) would answer to "what are the desired products that I cannot buy in the store?". Still, many applications need other comparison predicates that are not limited to identity. For example, if we add quantity and price to the sets of desired products and store's products, comparing the tuples by identity won't have much sense, since a product in the store with stock greater than the required should be valid, and it is also valid a product with a price lower than the user's maximum budget for that product. This MSc work presents the new Relational Conditional Set Operations. The novel operators encapsulate the idea of set operations with conditional queries, facilitating specific operators for them, and allowing their optimization. For example, they are potentially useful in applications of product sales with units and prices, job promotions with skills that have enough experience or certification level, and internships with minimum grades. We validate our proposal's semantics and scalability by studying the first of these applications. Also, we open path for future works such as: to implement the operators in a DBMS; to propose SQL queries able to answer these kind of queries and compare it with our current approach; to extend the idea for bag algebra; to explore a whole new path of optimization for our algorithms; to add support for complex data, allowing similarity comparisons in the predicate; and, to study the use of these operators as basis for other operations that currently use the traditional set operation as basis; among others.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.