Linq
Linq
Language Integrated Query
Linq - Basics
Linq allows querying data that are available in various different formats using a unified querying model.
We might query data in one of the following representations:
- Relational data using SQL
- In-Memory Data as object graphs
- XML documents
- …
Linq uses a declarative programming paradigm. You define what you want, not how to get there.
Linq - Basics (2)
A Linq Query consists of 3 steps:
- Define a data source
- Create a query
- Execute the query
Linq - Data Source
Linq heavily build on the IEnumerable and IQueryable interfaces. Whenever a data source implements one of these interfaces, Linq can be used to query the data behind it.
If a datasource doesn’t offer one of these interfaces, the data must be loaded into memory (e.g. into a List, Array, or similar that implements IEnumerable) to be able to query it.
Linq - Data Source (2)
// Step 1 - Define data sourceList<Product> products = new List<Product> { new Product("Coke", categoryId: 0), new Product("Tea", categoryId: 0), new Product("Apple", categoryId: 1)}
Linq - Create a query
Linq offers 2 different ways of creating a query
- Query Expressions
- Method Syntax
Query Expressions
Query expressions look similar to SQL statements. These expressions are built into the C# language and converted by the compiler to method calls.
var query = from product in products join category in categories on product.CategoryId equals category.Id select new { CategoryName = category.CategoryName, Name = product.Name }
Query Expressions (2)
Method syntax
The same query can be defined using regular C# method calls and lambda expressions.
var query = products .Join( categories, p => p.CategoryId, c => c.Id, (p, c) => new { CategoryName = c.CategoryName, Name = p.Name });
Linq - Execute the query
Depending on the underlying implementation of the data source, executing the query might mean completely different things.
E.g. when querying data stored in memory (e.g. List<Product>
) it will use C# methods to create the result. When talking to a database using an IQueryable object, an SQL query will be generated instead and executed on the database (that’s called Linq to SQL btw.).
Linq - Executing the query
You can execute a query by enumerating the elements of the IEnumerable interface, for example using a foreach loop, or by calling ToList:
// Execute a query using a loopforeach (var product in query) { Console.WriteLine( $"{product.Name} {product.CategoryName}");}
// Execute a query using ToListvar result = query.ToList();
Deferred execution / Lazy evaluation
When defining a Linq query, no operation on the underlying data source will be triggered. We only specify what will happen, once we actually execute the query.
Only by enumerating the query, calling ToList / ToArray, or similar we actually perform the operations specified in the query.
Selection and Projection
When using a where operation, what we do is actually a selection. We choose which of the available items should be included in the result.
When using a select operation, what we do is actually a projection. We project one representation of data on another representation. For example instead of querying projects, we only query the name for each project.
Linq - Select ( Projection)
Linq - Select (Projection) (2)
Linq - Select (Projection) (3)
For projection you can also use anonymous classes.
IQueryable<Customer> customers = db.GetTable<Customers>();
var namePhoneQuery = from cust in customers where cust.City == "London" select new { Name = cust.Name, Phone = cust.Phone };
foreach (var item in namePhoneQuery) { Console.WriteLine($"{item.Name}: {item.Phone}");}
Linq - Where (Selection)
We can use the where
clause in Linq to restrict the data we want to return. This is called selection, or filtering.
Linq - Where (Selection) (2)
IList<Student> studentList = new List<Student>() { new Student { StudentId = 1, StudentName = "John", Age = 13 }, new Student { StudentId = 2, StudentName = "Detlef", Age = 21 }};
Linq - Where (Selection) (3)
// Query all students whose names have at least 5 charactersvar result = from s in studentList where s.Length >= 5 select s;
// Find all students aged between 13 and 19var result = from s in studentList where s.Age <= 19 && s.Age >= 13 select s;
Linq - Where with delegate
Func<Student, bool> isTeenager = (Student s) => s.Age <= 19 && s.Age >= 13;
// Find all students aged between 13 and 19var result = from s in studentList where isTeenager(s) select s;
Linq - From
Use from to define the data source of your query.
var result = from s in studentList select s.StudentName.Substring(0, 1);
Linq - OrderBy
Use OrderBy to sort the elements of your query.
string[] words = [ "the", "quick", "brows", "fox", "jumps"];
IEnumerable<string> query = from word in words orderby word.Length select word;
Linq - OrderBy (2)
You can also define the order (ascending or descending). If not specified, ascending sorting will be used.
string[] words = [ "the", "quick", "brows", "fox", "jumps"];
IEnumerable<string> query = from word in words orderby word.Length descending select word;
Data Aggregation
You can use different forms of join operations with Linq.
from ... in <outerSequence>join ... in <innerSequence>on <outerKey> equals <innerKey>select ...
Linq - Inner Join
// Step 1: Define a data sourceList<Product> products = new List<Product> { new Product("Apple", 1), new Product("Coke", 0), new Product("Tea", 0)};
List<Category> categories = new List<Category> { new Category(0, "Beverage"), new Category(1, "Fruit"), new Category(2, "Vegetable")};
Linq - Inner Join (2)
// For all products, find the product name and category namevar result = from p in products join c in categories on p.CategoryId equals c.Id select new { CategoryName = c.CategoryName, Name = p.Name }
Linq - Inner Join - Object Association
// Define a data sourceclass Person { public string FirstName { get; set; } public string LastName { get; set; }}
class Pet { public string Name { get; set; } public Person Owner { get; set; }}
Linq - Inner Join - Object Association (2)
// For all pets, find the pet's name and the owner's namevar query = from p in people join pet in pets on p equals pet.Owner select new { OwnerName = p.FirstName, PetName = pet.Name }
Linq - Inner Join - Composite Key
// Define a data sourceList<Employee> employees = new List<Employee> { new Employee( "Terry", "Adams", 522459 )};
List<Student> students = new List<Student> { new Student( "Vernette", "Price", 9562 )};
Linq - Inner Join - Composite Key (2)
// Define a queryvar result = from e in employees join s in students on new { e.FirstName, e.LastName } equals new { s.FirstName, s.LastName } select e.FirstName + " " + e.LastName;
Linq - Inner Join - Multiple Join
// Define a data sourceclass Person { public string FirstName { get; set; } public string LastName { get; set; }}
class Pet { public string Name { get; set; } public Person Owner { get; set; }}
class Cat : Pet { }class Dog : Pet { }
Linq - Inner Join - Multiple Join (2)
// Define a queryvar query = from p in people join c in cats on p equals c.Owner join d in dogs on new { Owner = p, Letter = c.Name.Substr(0, 1) } equals new { d.Owner, Letter = d.Name.Substr(0, 1) } select new { CatName = c.Name, DogName = d.Name }
Linq - Partitioning
Partitioning operators are used to partition the result of a query.
Linq - Partitioning (2)
- Operators for Partitioning:
- Skip
- SkipWhile
- Take
- TakeWhile
Linq - Partitioning (3)
Skip ignores the first n elements.
IEnumerable<string> query = select ... ;
query.Skip(5);
Linq - Partitioning (4)
SkipWhile ignores elements at the beginning of the data source as long as they fulfil a certain condition.
IEnumerable<string> query = {59, 82, 70, 56, 92};
query.SkipWhile(grade => grade >= 80);
Linq - Partitioning (5)
Take limits the result to a certain amount of elements.
IEnumerable<string> query = {59, 82, 70, 56, 92};query.Take(3);
Linq - Aggregation
Aggregation Operators can be used to find aggregates of multiple data entries.
Linq - Aggregation (2)
- Aggregation Operators:
- Min, Max, Average
- Count, Sum
- Aggregate
Linq - Aggregation (3)
// Compute the sum of the elements of an arrayvar numbers = new int[] { 4, 56, 2, 5, 43, 5 };int sum = (from n in numbers).Sum();
// Compute the sum of the elements of an arrayvar numbers = new int[] { 4, 56, 2, 5, 43, 5 };int sum = numbers.Sum();
Linq - Aggregation (4)
// Compute the sum of the elements of an arrayvar numbers = new int[] { 4, 5, 3, 9 }j
int sum = numbers.Aggregate( (result, item) => result + item);
Linq - Aggregation (5)
Linq - Grouping
Use tho group clause to group data.
Linq - Grouping (2)
List<int> numbers = new List<int> { 35, 44, 200, 84, 3987, 4, 199, 446};
// Group the list of numbers into even and odd numbersIEnumerable<IGrouping<int, int>> query = from n in numbers group n by n % 2;
List<int> numbers = new List<int> { 35, 44, 200, 84, 3987, 4, 199, 446};
// Group the list of numbers into even and odd numbersIEnumerable<IGrouping<int, int>> query = numbers.GroupBy(n => n % 2);
Linq - Grouping (3)
foreach (var group in query) { Console.WriteLine( group.Key == 0 ? "Even numbers" : "Odd numbers")
foreach (var i in group) { Console.WriteLine(i); }}
Linq - Grouping (4)
List<Student> students = new List<Student> { new { "Terry", "Adams", 120, GradeLevel.SecondYear }, new { "Fadi", "Fakhouri", 120, GradeLevel.ThirdYear }};
Linq - Grouping (5)
Group elements by a single property.
var lastNames = from s in students group s by s.LastName into newGroup // grouped data orderby newGroup.Key select newGroup;
var lastNames = students .GroupBy(s => s.LastName) .OrderBy(g => g.Key);
Linq - Grouping (6)
Group elements by a derived value.
var groupResults = from s in students group s by s.LastName[0]
var groupResults = students .GroupBy(s => s.LastName[0]);
foreach (var studentGroup in groupResults) { foreach (var s in studentGroup) { Console.WriteLine( $"{s.LastName} {s.FirstName}" ); }}
Linq - Grouping (7)
var query = from s in students group s by new { FirstLetter = s.LastName[0], Score = s.ExamScore > 85 } into studentGroup // grouped data orderby studentGroup.Key.FirstLetter select studentGroup;
var query = students .GroupBy(s => new { FirstLetter = s.LastName[0], Score = s.ExamScore > 85 }) .OrderBy(g => g.Key.FirstLetter);
Linq - Group and Filter
var query = from s in students group s by new { FirstLetter = s.LastName[0], Score = s.ExamScore > 85 } into studentGroup where studentGroup.Count() > 3 select studentGroup
var query = students .GroupBy(s => new { FirstLetter = s.LastName[0], Score = s.ExamScore > 85 }) .Where(g => g.Count() > 3) .OrderBy(g => g.Key.FirstLetter);
Linq - Min, Max, Average
var players = new List<Player> { new Player(name: "Alex", team: "A", score: 10), new Player(name: "Anna", team: "A", score: 20), new Player(name: "Luke", team: "L", score: 60), new Player(name: "Lucy", team: "L", score: 40)};
Linq - Min, Max, Average (2)
var scores = from p in players group p by p.Team into playerGroup select new { Team = playerGroup.Key, TotalScore = playerGroup.Sum( x => x.Score ) };
var scores = players .GroupBy(p => p.Team) .Select(g => new { Team = playerGroup.Key, TotalScore = g.Sum(x => x.Score) });
Linq - Set Operations
To further specify the data source of a query you can use set operations such as:
- Distinct
- Except
- Intersect
- Union
Linq - Distinct
The Distinct method removes duplicates from the data source.
Linq - Distinct (2)
string[] planets = { "Mercury", "Venus", "Venus", "Earth", "Mars", "Earth"};
var query = from planet in planets.Distinct() select planet;
var query = planets.Distinct();
Linq - Except
Linq - Except (2)
string[] planets1 = { "Mercury", "Venus", "Earth", "Jupiter"};
string[] planets2 = { "Mercury", "Earth", "Mars", "Jupiter"};
var query = from planet in planets1.Except(planets2) select planet;
var query = planets1.Except(planets2);
Linq - Intersect
Intersect finds elements that appear in both data sources.
Linq - Intersect (2)
string[] planets1 = { "Mercury", "Venus", "Earth", "Jupiter"};
string[] planets2 = { "Mercury", "Earth", "Mars", "Jupiter"};
var query = from planet in planets1.Intersect(planets2) select planet;
vat query = planets1.Intersect(planets2);
Linq - Union
Union combines two data sources to one, including all elements from both collections.
Union will also remove duplicates as long as the items can be compared using the overriden Equals
method or an comparer object that implements the IEquatable<T>
interface.
Linq - Union (2)
string[] planets1 = { "Mercury", "Venus", "Earth", "Jupiter"};
string[] planets2 = { "Mercury", "Earth", "Mars", "Jupiter"};
var query = from planet in planets1.Union(planets2) select planet;
var query = planets1.Union(planets2);
Linq - Quantifiers
Quantifiers classify a set of values. The return value of a quantifier is a boolean value.
- Operators:
- All
- Any
- Contains
Linq - All
The All quantifier checks, whether each and everyone of the elements of a data source fulfils a condition.
List<Market> markets = new List<Market> { new Market { Name = "Emiliy", Items = new string[]{"kiwi", "cherry", "banana"} }, new Market { Name = "Kim", Items = new string[]{"melon", "mango", "olive"} }};
Linq - All (2)
// Which market only offers fruits// of which the name is at least 5 characters longvar query = from m in markets where m.Items.All(item => item.Length >= 5) select m.Name;
var query = markets .Where(m => m.Items.All(item => item.Length >= 5));
Linq - Any
The Any quantifier checks, whether at least one element fulfils a condition.
// Which market sells at least one kind of fruit// that starts with the letter o.var query = from m in markets where m.Items.Any(item => item.StartsWith("o")) select m.Name;
var query = markets .Where(m => m.Items.Any(item => item.StartsWith("o")));
Linq - Contains
The Contains quantifier checks whether a certain element exists in the data source.
// Which market sells at least one fruit that starts with ovar query = from m in markets where m.Items.Contains("kiwi") select m.Name;
var query f markets .Where(m => m.Items.Contains("kiwi"));
Linq - Subqueries
Subqueries are queries that are embedded into other queries.
var players = new List<Player> { new Player(name: "Alex", team: "A", score: 10), new Player(name: "Anna", team: "A", score: 20), new Player(name: "Luke", team: "L", score: 60), new Player(name: "Lucy", team: "L", score: 40)};
Linq - Subqueries (2)
// Which players got the most points?var query = from p in players where p.Points == (from p1.Points in players).Max() select p;
var query = players .Where(p => p.Points == players.Max(p1 => p1.Points));