Anatomy of Random Effects

CASE 1 OF 10

IID, one latent value per observation

Spec: {'id': 'row_id', 'model': 'iid'} with 10 distinct row_id values

1 hyperparamlog τ(one precision for all 10 latent values)

y_i = μ + u_i, u_i iid~ N(0, τ⁻¹), i = 1..10

One latent value per observation. Classic use: absorbing overdispersion. n_levels = n_obs = 10.

i=1	y₁	=	μ	+	u₁
i=2	y₂	=	μ	+	u₂
i=3	y₃	=	μ	+	u₃
i=4	y₄	=	μ	+	u₄
i=5	y₅	=	μ	+	u₅
i=6	y₆	=	μ	+	u₆
i=7	y₇	=	μ	+	u₇
i=8	y₈	=	μ	+	u₈
i=9	y₉	=	μ	+	u₉
i=10	y₁₀	=	μ	+	u₁₀

CASE 2 OF 10

IID, grouped (the hierarchical case)

Spec: {'id': 'school_id', 'model': 'iid'} with 4 schools, 10 students

1 hyperparamlog τ(one precision for the 4 school effects)

y_i = μ + u_g(i), u_k iid~ N(0, τ⁻¹), k = 1..4

Many observations share the same latent value (students in schools). n_levels (4) is smaller than n_obs (10), so the same u appears on several rows.

i=1	y₁	=	μ	+	u₁
i=2	y₂	=	μ	+	u₁
i=3	y₃	=	μ	+	u₁
i=4	y₄	=	μ	+	u₂
i=5	y₅	=	μ	+	u₂
i=6	y₆	=	μ	+	u₂
i=7	y₇	=	μ	+	u₃
i=8	y₈	=	μ	+	u₃
i=9	y₉	=	μ	+	u₄
i=10	y₁₀	=	μ	+	u₄

CASE 3 OF 10

IID, replicated (nrep independent copies of the same iid block)

Spec: {'id': 'level', 'model': 'iid', 'replicate': rep, 'nrep': 2}

1 hyperparamlog τ(shared across both replicates)

y_i = μ + u_{ℓ(i), r(i)}, u_k,r iid~ N(0, τ⁻¹), k = 1..5, r = 1..2

The whole iid block is repeated nrep times, independently. Each observation carries two integer indices: its level (1..5) and which replicate it belongs to (1..2). Together they pick out one of 5 × 2 = 10 latent values. The two replicate blocks share one precision τ (estimated jointly), but the realized u-draws across reps are uncorrelated: same level in different reps gives a different u.

The two index vectors (each length 10)
i
                  12345
                  678910
                
level
                  12345
                  12345
                
rep
                  11111
                  22222
                

                Both vectors have length 10. level has 5 unique values (each appearing twice); rep has 2 unique values. Together they index 5 × 2 = 10 independent latent values.
              

rep = 1
i=1	y₁	=	μ	+	u_1,1
i=2	y₂	=	μ	+	u_2,1
i=3	y₃	=	μ	+	u_3,1
i=4	y₄	=	μ	+	u_4,1
i=5	y₅	=	μ	+	u_5,1
rep = 2
i=6	y₆	=	μ	+	u_1,2
i=7	y₇	=	μ	+	u_2,2
i=8	y₈	=	μ	+	u_3,2
i=9	y₉	=	μ	+	u_4,2
i=10	y₁₀	=	μ	+	u_5,2

Why colors split into two families? Cool colors (indigo → teal) mark rep = 1; warm colors (pink → amber) mark rep = 2. Observations 1 and 6 both have level = 1 but their badges have different colors, because they pick up different latent values (u_1,1 vs u_1,2).

CASE 4 OF 10

IID + `control.group` (correlation across a second axis)

Spec: {'id': 'region', 'model': 'iid', 'group': week, 'ngroup': 2, 'control.group': {'model': 'ar1'}}

2 hyperparamslog τ + AR1 ρ(precision shared across regions; second hyperparam is the across-group AR1 correlation)

y_i = μ + u_{ℓ(i), g(i)}
within group g: u_·,g iid~ N(0, τ⁻¹) | across g: corr(u_k,1, u_k,2) = ρ (AR1)

Same 2D layout as case 3 (5 regions × 2 weeks = 10 latent values), but now the second axis carries correlation, not independent copies. Within each week the 5 regions are still iid. Between the two weeks, the matching latent values are tied together: u_k,1 and u_k,2 share a one-step AR1 correlation ρ. So obs 1 (region 1, week 1) and obs 6 (region 1, week 2) are not the same value, but they are correlated: knowing one tells you something about the other.

The two index vectors (each length 10)
i
                  12345
                  678910
                
region
                  12345
                  12345
                
week
                  11111
                  22222
                

                Same shape as case 3 (10 obs, 5 regions, 2 weeks, 5 × 2 = 10 latent values). The difference is the control.group dict: setting 'model': 'ar1' turns the across-group axis from iid into AR1.
              

g = 1 (week 1)
i=1	y₁	=	μ	+	u_1,1
i=2	y₂	=	μ	+	u_2,1
i=3	y₃	=	μ	+	u_3,1
i=4	y₄	=	μ	+	u_4,1
i=5	y₅	=	μ	+	u_5,1
g = 2 (week 2), AR1-correlated with g = 1
i=6	y₆	=	μ	+	u_1,2
i=7	y₇	=	μ	+	u_2,2
i=8	y₈	=	μ	+	u_3,2
i=9	y₉	=	μ	+	u_4,2
i=10	y₁₀	=	μ	+	u_5,2

Why colors come in light/dark pairs? Each region gets one hue. The saturated shade marks week 1, the lighter shade marks week 2. Same hue says "same region, AR1-correlated"; different shade says "different week so different draw." Contrast with case 3, where rep 1 and rep 2 use unrelated cool / warm families because they are independent.

case 3 vs case 4: identical data layout, identical level and group vectors, identical 5 × 2 = 10 latent values. The only difference is the control.group dict (absent in case 3, set to {'model': 'ar1'} here). That single key swaps independence for correlation along the second axis.

CASE 5 OF 10

RW1 (ordered, neighbors are correlated)

Spec: {'id': 'time', 'model': 'rw1'} with 10 ordered time points

1 hyperparamlog τ(one precision for the RW1 increments)

y_i = μ + u_i, u_i − u_i−1 iid~ N(0, τ⁻¹)

One latent value per ordered position, like IID, but neighbors are tied together by an increment penalty. Colors shade smoothly to remind you of the neighbor coupling.

i=1	y₁	=	μ	+	u₁
i=2	y₂	=	μ	+	u₂
i=3	y₃	=	μ	+	u₃
i=4	y₄	=	μ	+	u₄
i=5	y₅	=	μ	+	u₅
i=6	y₆	=	μ	+	u₆
i=7	y₇	=	μ	+	u₇
i=8	y₈	=	μ	+	u₈
i=9	y₉	=	μ	+	u₉
i=10	y₁₀	=	μ	+	u₁₀

CASE 6 OF 10

The `group` key on its own (what does it actually do?)

Spec: {'id': 'region', 'model': 'iid', 'group': week, 'ngroup': 2} (no control.group set)

1 hyperparamlog τ(default control.group is iid, so no second hyperparam)

y_i = μ + u_{ℓ(i), g(i)}
within group g: u_·,g iid~ N(0, τ⁻¹) | across g: independent (default control.group is iid)

group introduces a second indexing axis on top of id. By itself, with no control.group dict, the across-group axis defaults to iid: all 5 × 2 = 10 latent values are independent draws. Mathematically, this case is identical to case 3 (replicate). The point of group is not what it does on its own; it is what it lets you do next: add a control.group dict to make the across-group axis correlated (case 4).

The two index vectors (each length 10)
i
                  12345
                  678910
                
region
                  12345
                  12345
                
week
                  11111
                  22222
                

                Same shape as case 3 and case 4 (10 obs, 5 regions, 2 weeks, 5 × 2 = 10 latent values). What changes between the three cases is only the spec dict.
              

g = 1 (week 1)
i=1	y₁	=	μ	+	u_1,1
i=2	y₂	=	μ	+	u_2,1
i=3	y₃	=	μ	+	u_3,1
i=4	y₄	=	μ	+	u_4,1
i=5	y₅	=	μ	+	u_5,1
g = 2 (week 2), independent of g = 1 (default iid)
i=6	y₆	=	μ	+	u_1,2
i=7	y₇	=	μ	+	u_2,2
i=8	y₈	=	μ	+	u_3,2
i=9	y₉	=	μ	+	u_4,2
i=10	y₁₀	=	μ	+	u_5,2

Why colors split into two families (cool / warm)? Same scheme as case 3: cool for g = 1, warm for g = 2. Independent groups get unrelated hue families. Compare with case 4, where the same hue (saturated vs light) marks the AR1 tie between matched regions.

The three sister cases at a glance:

Case 3 (replicate + nrep): two independent copies. Posterior output is indexed by (region, rep).
Case 6 (group + ngroup, no control.group): mathematically the same as case 3. The keyword choice mostly affects output labeling and what extensions you can add later.
Case 4 (case 6 + 'control.group': {'model': 'ar1'}): adds AR1 correlation across groups. This is what group exists for.

CASE 7 OF 10

RW1 cyclic (neighbors wrap around: position n ties back to position 1)

Spec: {'id': 'hour', 'model': 'rw1', 'cyclic': True}

1 hyperparamlog τ(same as RW1; cyclic adds an edge to the graph, not a hyperparameter)

y_i = μ + u_i
u_i − u_i−1 iid~ N(0, τ⁻¹) for i = 2..n | plus u₁ − u_n iid~ N(0, τ⁻¹)

Same setup as case 5 (linear RW1): one latent value per ordered position, smooth-with-neighbors prior. The only difference is the graph topology: setting cyclic: True adds one extra increment u₁ − u_n to the prior, so the start and end of the index are tied together. Use it when the index wraps naturally (hour-of-day, day-of-week, month-of-year, any angular variable).

Neighbor graph

Position 10 wraps to position 1 (pink dashed edge).

i=1	y₁	=	μ	+	u₁
i=2	y₂	=	μ	+	u₂
i=3	y₃	=	μ	+	u₃
i=4	y₄	=	μ	+	u₄
i=5	y₅	=	μ	+	u₅
i=6	y₆	=	μ	+	u₆
i=7	y₇	=	μ	+	u₇
i=8	y₈	=	μ	+	u₈
i=9	y₉	=	μ	+	u₉
i=10	y₁₀	=	μ	+	u₁₀

Precision matrix Q = τR: linear vs cyclic

The structure matrix R for cyclic RW1 is a circulant tridiagonal: 2 on every diagonal entry, −1 on every nearest-neighbor off-diagonal, plus −1 in the two corners for the wrap edge between positions 1 and n. Compared with linear RW1, the only differences are the four highlighted cells: the boundary diagonals jump from 1 to 2 (positions 1 and n now have two neighbors each), and the two corners pick up −1 entries (the wrap edge).

Linear RW1 (cyclic = False): pure tridiagonal, rank n−1

Diagonal: 1, 2, 2, ..., 2, 2, 1. Off-diagonal: −1. Corners: 0.

Cyclic RW1: circulant tridiagonal, rank still n−1

Diagonal: 2, 2, ..., 2. Off-diagonal: −1. Corners: −1 (dashed, wrap edge).

Summary of changes (4 cells out of 100):

R[1,1] and R[n,n]: 1 → 2 (endpoints gain a neighbor)
R[1,n] and R[n,1]: 0 → −1 (the wrap edge)
Every other cell is unchanged.

Does the wrap edge remove the need for a constraint? No. The null space of R is still the constant vector (1, 1, ..., 1): adding the same number to every u_i leaves every increment (including the wrap one) unchanged. Rank of R stays at n − 1. Cyclic RW1 needs a sum-to-zero constraint just like linear RW1, and constr=True is on by default in pyINLA. What the wrap edge does remove is the freedom for the start and end of the path to drift apart; what it does not remove is the freedom to shift the whole curve up or down.

Why the colors form a wheel? The 10 badges trace the hue circle in steps of 36°. Position 10 (purple-red) sits visually next to position 1 (red) just like its prior neighbor relationship. Compare case 5 where colors run from teal to violet along a line: in linear RW1 position 10 has no special tie to position 1.

Linear (case 5) vs cyclic (case 7):

Both have 1 hyperparameter (log τ), both have rank n − 1 (one null direction: the overall level), both default to constr=True.
Linear RW1 has n−1 = 9 neighbor pairs; cyclic RW1 has n = 10 (one extra wrap edge).
Cyclic samples are tied at the ends; linear samples are free to drift.

CASE 8 OF 10

IID with `weights` (per-observation design scaling)

Spec: {'id': 'group_id', 'model': 'iid', 'weights': w} (w is a 1-D numpy array of length n_obs)

1 hyperparamlog τ(weights are fixed data, not parameters: they do not add a hyperparameter)

y_i = μ + w_i · u_g(i)
u_k iid~ N(0, τ⁻¹), k = 1..3 | w_i are fixed inputs (not estimated)

weights scales the random-effect contribution per observation: w_i multiplies u_g(i) in the linear predictor. The latent vector is the same as case 2 (one u per group). What changes is the design matrix: instead of putting a 1 wherever observation i picks up group g, it puts w_i. So obs sharing a group keep the same colour (same u), but the badge label carries the weight factor.

This is not a likelihood weight. The values w_i here do not change how much each y contributes to the log-likelihood; they multiply the random effect on the linear-predictor side. If you want likelihood weights (importance / inverse-probability weighting), set the family-level weights argument instead.

Index + weight vectors (each length 10)
i12345678910
group_id1112223333
weights1.02.00.51.51.02.01.00.51.52.0

                3 unique groups (3, 3, 4 observations each), so the iid block still has just 3 latent values u1, u2, u3. The weights are not estimated: they enter the design matrix directly.
              

i=1	y₁	=	μ	+	1.0 · u₁
i=2	y₂	=	μ	+	2.0 · u₁
i=3	y₃	=	μ	+	0.5 · u₁
i=4	y₄	=	μ	+	1.5 · u₂
i=5	y₅	=	μ	+	1.0 · u₂
i=6	y₆	=	μ	+	2.0 · u₂
i=7	y₇	=	μ	+	1.0 · u₃
i=8	y₈	=	μ	+	0.5 · u₃
i=9	y₉	=	μ	+	1.5 · u₃
i=10	y₁₀	=	μ	+	2.0 · u₃

Reading the badges. Colour = group (indigo / green / amber). The "w · u_g" label inside the badge is the literal contribution to that row's η_i. Rows 1, 2, 3 all share u₁ but multiply it by 1.0, 2.0, and 0.5 respectively. Same u, different pull.

Three practical uses for weights:

Exposure-style scaling on an RE: a group's effect is multiplied per row by population, time at risk, or area.
Linear slope inside an f() block: set id = 1 (one level), weights = x_i. Then u₁ plays the role of a slope (η_i += x_i · u₁).
Custom design contrasts: fractional or signed shares of a group's u (positive and negative weights are allowed).

CASE 9 OF 10

IID with `values` (reserve slots for unobserved levels)

Spec: {'id': 'school_id', 'model': 'iid', 'values': [1, 2, 3, 4, 5]} (10 students across 4 schools; school 3 has no data yet)

1 hyperparamlog τ(values is a fixed list, not a parameter; declares 5 latent slots regardless of what is observed)

y_i = μ + u_{school_id(i)}
u_k iid~ N(0, τ⁻¹), k = 1..5 | level set fixed by values, not inferred from the data

values declares the complete set of allowed levels for the id column. Without it, pyINLA infers the level set from sort(unique(id)) in the data, so the latent vector has one entry per observed level. With values, the latent vector has one entry per declared level, even those that never appear in the data. Those extra slots are governed purely by the prior; there are no y_i rows that would update them.

What the data has, vs what values declares
i
                  12345678910
                
school_id
                  1112244555
                
                values declares: [1, 2, 3, 4, 5]
              
                The data has 4 distinct schools (1, 2, 4, 5). values declares 5 levels. School 3 is declared but no observation refers to it.

i=1	y₁	=	μ	+	u₁	school 1
i=2	y₂	=	μ	+	u₁	school 1
i=3	y₃	=	μ	+	u₁	school 1
i=4	y₄	=	μ	+	u₂	school 2
i=5	y₅	=	μ	+	u₂	school 2
i=6	y₆	=	μ	+	u₄	school 4
i=7	y₇	=	μ	+	u₄	school 4
i=8	y₈	=	μ	+	u₅	school 5
i=9	y₉	=	μ	+	u₅	school 5
i=10	y₁₀	=	μ	+	u₅	school 5

The full latent vector (length 5, set by values)

u₁school 1data-informed

u₂school 2data-informed

u₃no dataprior only

u₄school 4data-informed

u₅school 5data-informed

Why reserve slots for unobserved levels?

Prediction at unseen groups: you want a posterior for school 3 even though you have no rows for it yet. With values, the posterior is just the prior u₃ ∼ N(0, τ⁻¹); τ is still informed by the observed schools.
Stable indexing across runs: same values list across fits guarantees the same k means the same school in every posterior table.
Document the level set in the spec: makes the model self-describing instead of depending on whatever happens to be in df['school_id'] today.

CASE 10 OF 10

Seasonal (period m: windowed sums of length m are iid)

Spec: {'id': 'time', 'model': 'seasonal', 'season.length': 5} (10 ordered time points = 2 full cycles)

1 hyperparamlog τ(precision of each windowed sum; season.length is fixed metadata, not a parameter)

y_i = μ + u_i
u_i + u_i+1 + … + u_i+m−1 iid~ N(0, τ⁻¹), i = 1, …, n − m + 1

The seasonal model puts the prior on sliding sums of length m, not on neighbor differences. Every consecutive window of m latent values is constrained to look like a small Gaussian. Patterns that repeat with period m (so each cycle of length m roughly averages to zero) sit comfortably under this prior; non-periodic drift accumulates inside the windows and is penalized. The model is intrinsic: it has rank deficiency m − 1, identifiable once you have an intercept or other fixed effects.

The single index vector (length 10), period m = 5
i
                  12345
                  678910
                
time
                  12345
                  678910
                
position
                  12345
                  12345
                

                10 ordered observations span 2 full cycles of length 5. Position 1 of cycle 1 (i=1) and position 1 of cycle 2 (i=6) share a hue but are not forced equal: the seasonal model constrains sums, not individual matches.
              

cycle 1 (i = 1..5)
i=1	y₁	=	μ	+	u₁	pos 1, cycle 1
i=2	y₂	=	μ	+	u₂	pos 2, cycle 1
i=3	y₃	=	μ	+	u₃	pos 3, cycle 1
i=4	y₄	=	μ	+	u₄	pos 4, cycle 1
i=5	y₅	=	μ	+	u₅	pos 5, cycle 1
cycle 2 (i = 6..10), same positions, not forced equal
i=6	y₆	=	μ	+	u₆	pos 1, cycle 2
i=7	y₇	=	μ	+	u₇	pos 2, cycle 2
i=8	y₈	=	μ	+	u₈	pos 3, cycle 2
i=9	y₉	=	μ	+	u₉	pos 4, cycle 2
i=10	y₁₀	=	μ	+	u₁₀	pos 5, cycle 2

The seasonal constraints: every consecutive sum of length m = 5 must look Gaussian

window 1

u₁u₂u₃u₄u₅u₆u₇u₈u₉u₁₀

sum ∼ N(0, τ⁻¹)

window 2

u₁u₂u₃u₄u₅u₆u₇u₈u₉u₁₀

sum ∼ N(0, τ⁻¹)

window 3

u₁u₂u₃u₄u₅u₆u₇u₈u₉u₁₀

sum ∼ N(0, τ⁻¹)

window 4

u₁u₂u₃u₄u₅u₆u₇u₈u₉u₁₀

sum ∼ N(0, τ⁻¹)

window 5

u₁u₂u₃u₄u₅u₆u₇u₈u₉u₁₀

sum ∼ N(0, τ⁻¹)

window 6

u₁u₂u₃u₄u₅u₆u₇u₈u₉u₁₀

sum ∼ N(0, τ⁻¹)

For n = 10 obs and m = 5, there are n − m + 1 = 6 sliding windows. Each one constrains a 5-element sum to be small. This favours patterns whose values inside any cycle of length 5 roughly cancel.

The resulting precision matrix Q = τ R (structure matrix R for n = 10, m = 5)

Stack the six window constraints into the (n − m + 1) × n design matrix D (row i is [0,…,0, 1,1,1,1,1, 0,…,0] with the five 1s at positions i..i+4), then R = D^TD. Entry R_ab counts how many windows contain both positions a and b.

	1	2	3	4	5	6	7	8	9	10
1	1	1	1	1	1	0	0	0	0	0
2	1	2	2	2	2	1	0	0	0	0
3	1	2	3	3	3	2	1	0	0	0
4	1	2	3	4	4	3	2	1	0	0
5	1	2	3	4	5	4	3	2	1	0
6	0	1	2	3	4	5	4	3	2	1
7	0	0	1	2	3	4	4	3	2	1
8	0	0	0	1	2	3	3	3	2	1
9	0	0	0	0	1	2	2	2	2	1
10	0	0	0	0	0	1	1	1	1	1

Bandwidth = 2m − 1 = 9: R_ab = 0 whenever |a − b| ≥ m (no length-m window can contain both).
Interior diagonal = m = 5: positions in the middle of the index sit inside all m overlapping windows; positions near the boundary sit in fewer (a triangular ramp 1, 2, 3, 4, 5 from each corner).
Interior off-diagonals decay linearly: row 5 reads 1, 2, 3, 4, 5, 4, 3, 2, 1, 0, the autocorrelation of the all-ones vector of length m with itself.
Rank deficiency = m − 1 = 4: there are m − 1 directions that all windowed sums miss. (Concretely: shifting every value at slot k ≡ const (mod m) by the same amount while keeping the within-period sum at zero leaves every length-m sum unchanged.) An intercept absorbs one direction; the rest are pinned by data.

Side-by-side: RW1 (case 5) vs RW1 cyclic (case 7) vs Seasonal (this case)

Aspect	RW1 (case 5)	RW1 cyclic (case 7)	Seasonal (case 10)
Prior is on	neighbor increments u_i − u_i−1	same as RW1, plus the wrap pair u_n − u₁	sliding sums of m consecutive values
Built-in period	none, just smoothness	the full length n (one big loop)	configurable m via `season.length`
Forces u_i+m = u_i?	no period at all	only at the wrap (i = n ↔ i = 1)	no, only the sums are constrained
Typical sample	smooth random walk	smooth walk that closes back on itself	oscillation around zero with cycle m
When to reach for it	monotone trend, drift, no recurring shape	angle, day-of-year, anything truly periodic with one full cycle of data	monthly seasonality across years, weekly across weeks, etc.

Mnemonic.

RW1 ties neighbors together: good for smoothness, no period.
RW1 cyclic ties neighbors and closes the loop end-to-start: good when the index itself is a circle (angles, day-of-year).
Seasonal ties every windowed sum of length m: good when the pattern repeats every m steps and you have several cycles of data.

Enter password to continue

Anatomy: how does u attach to y?

IID, one latent value per observation

IID, grouped (the hierarchical case)

IID, replicated (nrep independent copies of the same iid block)

IID + `control.group` (correlation across a second axis)

RW1 (ordered, neighbors are correlated)

The `group` key on its own (what does it actually do?)

RW1 cyclic (neighbors wrap around: position n ties back to position 1)

Precision matrix Q = τR: linear vs cyclic

IID with `weights` (per-observation design scaling)

IID with `values` (reserve slots for unobserved levels)

Seasonal (period m: windowed sums of length m are iid)

i	1	2	3	4	5	6	7	8	9	10
`group_id`	1	1	1	2	2	2	3	3	3	3
`weights`	1.0	2.0	0.5	1.5	1.0	2.0	1.0	0.5	1.5	2.0

	1	2	3	4	5	6	7	8	9	10
1	1	1	1	1	1	0	0	0	0	0
2	1	2	2	2	2	1	0	0	0	0
3	1	2	3	3	3	2	1	0	0	0
4	1	2	3	4	4	3	2	1	0	0
5	1	2	3	4	5	4	3	2	1	0
6	0	1	2	3	4	5	4	3	2	1
7	0	0	1	2	3	4	4	3	2	1
8	0	0	0	1	2	3	3	3	2	1
9	0	0	0	0	1	2	2	2	2	1
10	0	0	0	0	0	1	1	1	1	1

i	1	2	3	4	5	6	7	8	9	10
`level`	1	2	3	4	5	1	2	3	4	5
`rep`	1	1	1	1	1	2	2	2	2	2

i	1	2	3	4	5	6	7	8	9	10
`region`	1	2	3	4	5	1	2	3	4	5
`week`	1	1	1	1	1	2	2	2	2	2

i	1	2	3	4	5	6	7	8	9	10
`region`	1	2	3	4	5	1	2	3	4	5
`week`	1	1	1	1	1	2	2	2	2	2

i	1	2	3	4	5	6	7	8	9	10
`school_id`	1	1	1	2	2	4	4	5	5	5

	1	2	3	4	5	6	7	8	9	10
1	1	1	1	1	1	0	0	0	0	0
2	1	2	2	2	2	1	0	0	0	0
3	1	2	3	3	3	2	1	0	0	0
4	1	2	3	4	4	3	2	1	0	0
5	1	2	3	4	5	4	3	2	1	0
6	0	1	2	3	4	5	4	3	2	1
7	0	0	1	2	3	4	4	3	2	1
8	0	0	0	1	2	3	3	3	2	1
9	0	0	0	0	1	2	2	2	2	1
10	0	0	0	0	0	1	1	1	1	1

Enter password to continue

IID, one latent value per observation

IID, grouped (the hierarchical case)

IID, replicated (nrep independent copies of the same iid block)

IID + control.group (correlation across a second axis)

RW1 (ordered, neighbors are correlated)

The group key on its own (what does it actually do?)

RW1 cyclic (neighbors wrap around: position n ties back to position 1)

Precision matrix Q = τR: linear vs cyclic

IID with weights (per-observation design scaling)

IID with values (reserve slots for unobserved levels)

Seasonal (period m: windowed sums of length m are iid)

IID + `control.group` (correlation across a second axis)

The `group` key on its own (what does it actually do?)

IID with `weights` (per-observation design scaling)

IID with `values` (reserve slots for unobserved levels)

	1	2	3	4	5	6	7	8	9	10
1	1	1	1	1	1	0	0	0	0	0
2	1	2	2	2	2	1	0	0	0	0
3	1	2	3	3	3	2	1	0	0	0
4	1	2	3	4	4	3	2	1	0	0
5	1	2	3	4	5	4	3	2	1	0
6	0	1	2	3	4	5	4	3	2	1
7	0	0	1	2	3	4	4	3	2	1
8	0	0	0	1	2	3	3	3	2	1
9	0	0	0	0	1	2	2	2	2	1
10	0	0	0	0	0	1	1	1	1	1