Better parameter learning and inductive types #191

rtjoa · 2024-07-19T03:01:28Z

TL;DR

We add support for learning arbitrary objectives in terms of probabilistic queries. To find the value of θ that maximizes pr(flip(θ) & flip(θ) & !flip(θ)):

θ = Var("θ")
x = flip(θ) & flip(θ) & !flip(θ)
var_vals = Valuation(θ => 0.5)  # intial assignments
loss = -LogPr(x)
train!(var_vals, loss)  # mutates var_vals
@test compute_mixed(var_vals, θ) ≈ 2/3

Macros make it easy to work with probabilistic inductive types:

@inductive Nat Z() S(Nat)
function Base.:(+)(x::Nat, y::Nat)
    @match y [
        Z() -> x,
        S(y′) -> S(x) + y′,
    ]
end

To ensure we always have up-to-date documentation, tours of the core of Dice.jl and parameter learning have been added to tests. I recommend first looking at these to get a better sense of the interface.

Better parameter learning

We update autodiff to represent the log probabilities of Dist{Bools}s symbolically, to train arbitrary loss functions instead of just doing MLE. In fact, we support "arbitrary interleavings" - computation dependent on log probabilities can be used as flip parameters, to create more symbolic log probabilities, etc.

The core construct we add to is the struct LogPr(::Dist{Bool}) <: ADNode.

To compute an ADNode containing a LogPr, use compute_mixed rather than compute.
To perform inference on a Dist containing a flip whose probability is dependent on a LogPr, use pr_mixed rather than pr.
train!(::Valuation, loss::ADNode; epochs, learning_rate) updates a valuation (dict from Vars to values) to minimize loss by GD

Examples and tests are given in test/autodiff_pr.

Other improvements

We add the functions sample_as_dist and frombits_as_dist, which work the same as their non-_as_dist counterparts, except they return deterministic Dists instead of Julia primitives (e.g. DistUInt32(3) instead of 3), allowing us to feed the results back into programs (e.g. passing them to prob_equals to check the probability of a particular sample/grounding).

Future work

Ideally, parameter learning integrates with a dedicated AD library like Zygote.jl. However, it requires care to make sure it plays well with CUDD, and we already have our tiny autodiff framework, so this PR does not make things much more complex.

This reverts commit b679333.

rtjoa added 30 commits December 26, 2023 16:19

mnist wip

627a833

[Autodiff] Support matrix vars, sin, cos

d33207e

Add MNIST

0657b89

Remove old mnist

bb79ba7

[Autodiff] Arbitrary pr-dependent loss fns

c0226ce

Add KL-divergence loss

9285201

Example: approximating int dists

4b9e7e2

tweak lcd

119bea4

lcd test

127d373

split adnode.jl out of core.jl

79db3e0

add LogPr

64f8991

arbitrary interleaving

421aca0

remove nodelogpr

dc1e647

remove commented adnodes: div, exp

c7fcb10

remove Variable

d1dd9e3

shrink AD interface

2b44dd1

remove learned_cheap_dists example

0b99903

remove mnist example

cf1fc3b

fix darts example

d9daee6

fix qc

ee12203

only expand once in train_pr

bce589b

Revert "remove nodelogpr"

b44d5c2

This reverts commit b679333.

update stlc version and reset init size to 5

ef2f8dd

add pr_mixed, support_mixed, and dir for autodiff_pr

d5235a9

have with_concrete_ad_flips support LogPr

ff69329

add neg_entropy

7cd0563

train generator w entropy

ae19e01

approx entropy

096f0b8

show loss

96104d6

add entropy approx output

43099e8

rtjoa added 28 commits June 22, 2024 08:03

eval repo scripts

057b790

stlc bespoke 5 1

8239f53

update tag

544cad2

add faaster

76bebd9

51 se

07bf4ea

se force meets

583a5b8

structure

44bd0d3

ex unif stlc

f0b35b5

idk

0d63501

rbt ablation

b4bfa4c

fixes

9a181cc

simpler ace

e6ef8eb

fix

630d819

no samples

dcebc81

rbt

c4aa247

rbt for unif depth

1d1cf39

rbt speconly

6b79481

after deadine

c637453

attempt faster stlc

78a5d2c

fix typo

d2f1ab6

bools fig

b6432be

tikz plot

912c558

remove codecov from CI

19293c5

move tours to tests

8c8869b

remove packages

62bbd1c

remove qc folder

e2df3c0

remove stats and other files

3bdd389

remove mkeval.sh

38da216

rtjoa marked this pull request as ready for review July 19, 2024 03:29

rtjoa merged commit dcd3406 into main Jul 19, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better parameter learning and inductive types #191

Better parameter learning and inductive types #191

rtjoa commented Jul 19, 2024 •

edited

Loading

Better parameter learning and inductive types #191

Better parameter learning and inductive types #191

Conversation

rtjoa commented Jul 19, 2024 • edited Loading

TL;DR

Better parameter learning

Other improvements

Future work

rtjoa commented Jul 19, 2024 •

edited

Loading