Table of Contents
ResourcesBinomial CoefficientsMethod 1: Pascal's Triangle (Dynamic Programming) - Method 2: Factorial Definition (Modular Inverses) - Solution - Binomial CoefficientsDerangementsMethod 1: Principle of Inclusion-ExclusionMethod 2: Dynamic ProgrammingStars and BarsExplanationImplementationExpected ValueExplanationImplementationLinearity of ExpectationExplanationImplementationExpected ProductsProblemsCombinatorics
Authors: Jesse Choe, Aadit Ambadkar, Dustin Miao, Mihnea Brebenel, Peng Bai
How to count.
Table of Contents
ResourcesBinomial CoefficientsMethod 1: Pascal's Triangle (Dynamic Programming) - Method 2: Factorial Definition (Modular Inverses) - Solution - Binomial CoefficientsDerangementsMethod 1: Principle of Inclusion-ExclusionMethod 2: Dynamic ProgrammingStars and BarsExplanationImplementationExpected ValueExplanationImplementationLinearity of ExpectationExplanationImplementationExpected ProductsProblemsIf you've never encountered any combinatorics before, AoPS is a good place to start.
Resources | ||||
---|---|---|---|---|
AoPS | practice problems; set focus to Counting and Probability for module topics | |||
AoPS | good book |
Resources
Resources | ||||
---|---|---|---|---|
CPH | module is based off of this | |||
cp-algo | ||||
HE | teaches fundamental combinatorics with a practice problem at the end | |||
AoPS | teaches basic combinatorics concepts | |||
AoPS | teaches more advanced combinatorics concepts | |||
CF | a good blog about the inclusion-exclusion principle |
If you prefer watching videos instead, here are some options:
Resources | ||||
---|---|---|---|---|
YouTube | playlist by mathemaniac | |||
YouTube | lectures 16-23 | |||
YouTube | Errichto video regarding expected value and sums of subsets |
Binomial Coefficients
Focus Problem – try your best to solve this problem before continuing!
The binomial coefficient (pronounced as " choose " or sometimes written as ) represents the number of ways to choose a subset of elements from a set of elements. For example, , because the set has subsets of elements:
There are two ways to calculate binomial coefficients:
Method 1: Pascal's Triangle (Dynamic Programming) -
Binomial coefficients can be recursively calculated as follows:
The intuition behind this is to fix an element in the set and choose elements from elements if is included in the set or choose elements from elements, otherwise.
The base cases for the recursion are:
because there is always exactly one way to construct an empty subset and a subset that contains all the elements.
This recursive formula is commonly known as Pascal's Triangle.
A naive implementation of this would use a recursive formula, like below:
C++
/** @return nCk mod p using naive recursion */int binomial(int n, int k, int p) {if (k == 0 || k == n) { return 1; }return (binomial(n - 1, k - 1, p) + binomial(n - 1, k, p)) % p;}
Java
/** @return nCk mod p using naive recursion */public static int binomial(int n, int k, int p) {if (k == 0 || k == n) { return 1; }return (binomial(n - 1, k - 1, p) + binomial(n - 1, k, p)) % p;}
Python
def binomial(n: int, k: int, p: int) -> int:""":return: nCk mod p using naive recursion"""if k == 0 or k == n:return 1return (binomial(n - 1, k - 1, p) + binomial(n - 1, k, p)) % p
Additionally, we can optimize this from to using dynamic programming (DP) by caching the values of smaller binomials to prevent recalculating the same values over and over again. The code below shows a bottom-up implementation of this.
C++
/** @return nCk mod p using dynamic programming */int binomial(int n, int k, int p) {// dp[i][j] stores iCjvector<vector<int>> dp(n + 1, vector<int>(k + 1, 0));// base cases described abovefor (int i = 0; i <= n; i++) {/** i choose 0 is always 1 since there is exactly one way* to choose 0 elements from a set of i elements
Java
/** @return nCk mod p using dynamic programming */public static int binomial(int n, int k, int p) {// dp[i][j] stores iCjint[][] dp = new int[n + 1][k + 1];// base cases described abovefor (int i = 0; i <= n; i++) {/** i choose 0 is always 1 since there is exactly one way* to choose 0 elements from a set of i elements
Python
def binomial(n: int, k, p):""":return: nCk mod p using dynamic programming"""# dp[i][j] stores iCjdp = [[0] * (k + 1) for _ in range(n + 1)]# base cases described abovefor i in range(n + 1):"""i choose 0 is always 1 since there is exactly one wayto choose 0 elements from a set of i elements
Method 2: Factorial Definition (Modular Inverses) -
Define as . represents the number of permutations of a set of elements. See this AoPS Article for more details.
Another way to calculate binomial coefficients is as follows:
Recall that also represents the number of ways to choose elements from a set of elements. One strategy to get all such combinations is to go through all possible permutations of the elements, and only pick the first elements out of each permutation. There are ways to do so. However, note the the order of the elements inside and outside the subset does not matter, so the result is divided by and .
Since these binomial coefficients are large, problems typically require us to output the answer modulo a large prime such as . Fortunately, we can use modular inverses to divide by and modulo for any prime . Computing inverse factorials online can be very time-consuming. Instead, we can precompute all factorials in time and inverse factorials in . First, we compute the modular inverse of the largest factorial using binary exponentiation. For the rest, we use the fact that . See the code below for the implementation.
C++
const int MAXN = 1e6;long long fac[MAXN + 1];long long inv[MAXN + 1];/** @return x^n modulo m in O(log p) time. */long long exp(long long x, long long n, long long m) {x %= m; // note: m * m must be less than 2^63 to avoid ll overflowlong long res = 1;while (n > 0) {
Java
import java.util.*;public class BinomialCoefficients {private static final int MAXN = (int)1e6;private static long[] fac = new long[MAXN + 1];private static long[] inv = new long[MAXN + 1];/** @return x^n modulo m in O(log p) time. */private static long exp(long x, long n, long m) {x %= m; // note: m * m must be less than 2^63 to avoid ll overflow
Python
MAXN = 10**6fac = [0] * (MAXN + 1)inv = [0] * (MAXN + 1)def exp(x: int, n: int, m: int) -> int:""":return: x^n modulo m in O(log p) time."""x %= m # note: m * m must be less than 2^63 to avoid ll overflowres = 1
Solution - Binomial Coefficients
The first method for calculating binomial factorials is too slow for this problem since the constraints on and are (recall that the first implementation runs in time complexity). However, we can use the second method to answer each of the queries in constant time by precomputing factorials and their modular inverses.
C++
#include <iostream>using namespace std;using ll = long long;const int MAXN = 1e6;const int MOD = 1e9 + 7;ll fac[MAXN + 1];ll inv[MAXN + 1];
Java
import java.io.*;import java.util.*;public class BinomialCoeffs {private static final int MAXN = (int)1e6;private static final int MOD = (int)1e9 + 7;private static long[] fac = new long[MAXN + 1];private static long[] inv = new long[MAXN + 1];public static void main(String[] args) {
Python
MAXN = 10**6MOD = 10**9 + 7fac = [0] * (MAXN + 1)inv = [0] * (MAXN + 1)Code Snippet: Counting Functions (Click to expand)
Derangements
Focus Problem – try your best to solve this problem before continuing!
The number of derangements of numbers, expressed as , is the number of permutations such that no element appears in its original position. Informally, it is the number of ways hats can be returned to people such that no person recieves their own hat.
Method 1: Principle of Inclusion-Exclusion
Suppose we had events , where event corresponds to person recieving their own hat. We would like to calculate .
We subtract from the number of ways for each event to occur; that is, consider the quantity . This undercounts, as we are subtracting cases where more than one event occurs too many times. Specifically, for a permutation where at least two events occur, we undercount by one. Thus, add back the number of ways for two events to occur. We can continue this process for every size of subsets of indices. The expression is now of the form:
For a set size of , the number of permutations with at least indicies can be computed by choosing a set of size that are fixed, and permuting the other indices. In mathematical terms:
Thus, the problem now becomes computing
which can be done in linear time.
C++
#include <atcoder/modint>#include <bits/stdc++.h>using namespace std;using mint = atcoder::modint;int main() {int n, m;cin >> n >> m;mint::set_mod(m);
Java
import java.util.Scanner;public class Main {public static void main(String[] args) {Scanner scanner = new Scanner(System.in);int n = scanner.nextInt();int m = scanner.nextInt();long c = 1;for (int i = 1; i <= n; i++) {
Python
n, m = map(int, input().split())c = 1for i in range(1, n + 1):c = (c * i) + (-1 if i % 2 == 1 else 1)c %= mc += mc %= mprint(c, end=" ")print()
Method 2: Dynamic Programming
Suppose person 1 recieved person 's hat. There are two cases:
- If person recieves person 1's hat, then the problem is reduced to a subproblem of size . There are possibilities for in this case, so we add to the current answer .
- If person does not recieve person 1's hat, then we can reassign person 1's hat to be person 's hat (if they recieved person 1's hat, then this would become first case). Thus, this becomes a subproblems with size , are there ways to choose .
Thus, we have
which can be computed in linear time with DP. The base cases are that and .
C++
#include <atcoder/modint>#include <bits/stdc++.h>using namespace std;using mint = atcoder::modint;int main() {int n, m;cin >> n >> m;mint::set_mod(m);
Java
import java.util.Scanner;public class Main {public static void main(String[] args) {Scanner scanner = new Scanner(System.in);int n = scanner.nextInt();int m = scanner.nextInt();int a = 1;int b = 0;
Python
n, m = map(int, input().split())a, b = 1, 0print(0, end=" ")for i in range(2, n + 1):c = (i - 1) * (a + b) % mprint(c, end=" ")a, b = b, cprint()
Stars and Bars
Resources | ||||
---|---|---|---|---|
cp-algo | ||||
Medium | Well documented article. |
Stars and Bars is a useful method in combinatorics that involves grouping indistinguishable objects into distinguishable boxes. The number of ways to put indistinguishable objects into distinguishable boxes is:
The second binomial coefficient from above can be derived from the property of binomial coefficients: .
Let's take a look at a particular example for and that has possibilities. As the name implies, the visualization is usually done with stars separated into groups by bars:
As you've probably noticed, there can be empty boxes - when we put all the stars in the first or second box. There may be cases in which the all the boxes should be non-empty. In that case, the number of ways to put indistinguishable objects into distinguishable non-empty boxes is:
Focus Problem – try your best to solve this problem before continuing!
Explanation
For this problem we should think the other way around: let's say that the colors from which we choose are in fact boxes and instead of choosing marbles we put them in the respective boxes. The problem has the restriction that we should pick at least one marble of all kinds, which means in our new perspective that all the boxes should be non-empty. Thus, the answer is obtained by the second formula:
Implementation
Time Complexity:
C++
#include <iostream>/** @return n choose k, computed naively since the problem has no overflow */long long comb(int n, int k) {if (k > n - k) { k = n - k; }long long ret = 1;for (int i = 0; i < k; i++) {// this is done instead of *= for divisibility issuesret = ret * (n - i) / (i + 1);}
Python
from math import combans = []for _ in range(int(input())):marble_num, color_num = [int(i) for i in input().split()]ans.append(str(comb(marble_num - 1, color_num - 1)))print("\n".join(ans))
Expected Value
Resources | ||||
---|---|---|---|---|
Brilliant | lots of pure math examples of expected value + explanations | |||
Brilliant | lots of pure math examples of linearity of expectation + explanations | |||
CF | a good blog about the expected value |
An expected value is the theoretical mean of a probability distribution. A random variable is used to represent a possible probability distribution. Let be a random variable and be the probability that the result of the random variable is . Then, the expected value of , denoted as , is
For example, let be the probability distribution of a fair 6-sided die. is for . Using the formula, we get .
Focus Problem – try your best to solve this problem before continuing!
Explanation
Let represent the probability distribution of the maximum number of candies a child gets. To get , we need for . is the probability that each child gets at most candies. To get , we must subtract out the probability that each child gets strictly less than candies, which is .
Therefore, , allowing us to calculate .
Implementation
Time Complexity: using naive exponentiation or using binary exponentiation.
C++
#include <cmath>#include <iomanip>#include <iostream>using std::cout;using std::endl;int main() {int n;int k;
Python
n, k = [int(i) for i in input().split()]expected_max = 0for c in range(1, k + 1):expected_max += c * ((c / k) ** n - ((c - 1) / k) ** n)print(f"{expected_max:.6f}")
Linearity of Expectation
Linearity of expectation states that no matter if and are independent of each other. For example, if on a certain day Alice goes to the gym with a probability of and her husband Bob goes to the gym with a probability of , the expected number of visits to the gym on a certain day amongst the couple is . This works even though the decisions of one person may affect the other.
This can be generalized to a sequence of random variables and arbitrary constants :
Focus Problem – try your best to solve this problem before continuing!
Explanation
While this may seem impossible at first glance given the amount of different sequences of operations, linearity of expectation allows us to break down the expected value into the sum of smaller parts.
We can break this down by decomposing the initial random variable into the sums of various indicator random variables. Let be a variable that indicates whether node was explicitly marked for removal. Since it's basically a boolean, it can only take on either or , which also means that the probability that it's is equal to its expected value.
Now, let's consider how an operation affects node .
- Either it chooses a node that can't reach , and nothing happens as far as is concerned.
- It chooses itself, making the indicator variable .
- It chooses another node that can reach , making the indicator variable .
Since all nodes have an equal chance of being chosen, the chance that is , where is the number of nodes that can reach including itself.
We put the expected values of all the indicator variables back together, giving us our final answer.
Implementation
Time Complexity: using a naive DFS or BFS.
C++
#include <iomanip>#include <iostream>#include <vector>using std::cout;using std::endl;using std::vector;int main() {int n;
Python
n = int(input())adj = [[] for _ in range(n)]for i in range(n):for v, c in enumerate(input()):if c == "1":adj[i].append(v)# reach_from[i] = the # of nodes that have a path to ireach_from = [0 for _ in range(n)]
Expected Products
Linearity of Expectation deals with , but what about ?
if and are independent from each other. We can reconsider the example of a fair 6-sided die to show that . We know that , so . On the other hand,
Problems
Status | Source | Problem Name | Difficulty | Tags | |
---|---|---|---|---|---|
CSES | Easy | Show TagsCombinatorics | |||
CSES | Easy | Show TagsCombinatorics | |||
CF | Easy | Show TagsCombinatorics | |||
CF | Easy | Show TagsBinary Search, Combinatorics | |||
Bronze | Easy | Show TagsCombinatorics | |||
Bubble Cup | Normal | Show TagsCombinatorics | |||
AC | Normal | Show TagsBitwise, Combinatorics | |||
CF | Normal | Show TagsCombinatorics | |||
AC | Normal | Show TagsCombinatorics | |||
Gold | Normal | Show TagsCombinatorics | |||
Gold | Normal | Show TagsBitset, PIE | |||
CF | Normal | Show TagsCombinatorics, DP | |||
Gold | Normal | Show TagsCombinatorics, Prefix Sums | |||
CF | Hard | Show TagsCombinatorics, DP | |||
Gold | Hard | Show TagsBinary Search, Combinatorics, Math, Probability |
Module Progress:
Join the USACO Forum!
Stuck on a problem, or don't understand a module? Join the USACO Forum and get help from other competitive programmers!