Tags · FluxML/Flux.jl

v0.16.10

[Diff since v0.16.9](v0.16.9...v0.16.10)

**Merged pull requests:**
- Bump julia-actions/cache from 2 to 3 (#2654) (@dependabot[bot])
- Add AGENTS.md with instructions for AI agents (#2655) (@CarloLucibello)
- Use ParallelTestRunner.jl for parallel test execution (#2656) (@CarloLucibello)
- Run Enzyme tests on julia 1.12 (#2659) (@CarloLucibello)
- Bump dorny/paths-filter from 3.0.2 to 4.0.1 (#2661) (@dependabot[bot])
- mark all documented methods as public (#2662) (@CarloLucibello)
- fix AutoZygote gradient (#2664) (@CarloLucibello)
- Use GPUArrays caching allocator in train! (#2665) (@CarloLucibello)
- Implement `_show_leaflike(::Tuple{})` to handle parameters consisting of an empty `Tuple` (#2667) (@JoshuaBillson)
- Bump codecov/codecov-action from 5 to 6 (#2668) (@dependabot[bot])
- Fix number formatting for PairwiseFusion (#2669) (@hardik-xi11)
- CompatHelper: bump compat for cuDNN in [weakdeps] to 6, (keep existing compat) (#2671) (@github-actions[bot])
- CompatHelper: bump compat for CUDA in [weakdeps] to 6, (keep existing compat) (#2672) (@github-actions[bot])

**Closed issues:**
- Incorrect link on docs (#2380)
- bad formatting for PairwiseFusion docstring (#2406)
- Use Caching Allocator in train! (#2636)
- all documented functions should be marked as `public` (#2638)
- Can't install flux. (#2651)
- GSoC 2026 Interest: RL Environments Project (#2663)

Apr 17, 2026
a2c6e5c
zip
tar.gz
Notes

v0.16.9

[Diff since v0.16.8](v0.16.8...v0.16.9)

**Merged pull requests:**
- Support Mooncake 0.5  (#2653) (@yebai)

Feb 1, 2026
ce4b8a0
zip
tar.gz
Notes

v0.16.8

[Diff since v0.16.7](v0.16.7...v0.16.8)

**Merged pull requests:**
- Enable other reactant tests (#2600) (@wsmoses)
- fix code in docs too wide (#2641) (@CarloLucibello)
- Set `early_stopping` default value for `init_score` to `Inf` (#2642) (@andreuvall)
- extend `gradient` to take an ADType argument (#2645) (@CarloLucibello)
- allow passing an ADType to train! (#2650) (@CarloLucibello)

**Closed issues:**
- Local Response Normalisation (#192)
- generic_matmul! hit in `back!` because type-promotion in activation function (#613)
- Layer Transpose (#635)
- add `trainstep!` (#666)
- Hypernetwork API (#797)
- Optimizer handling of infinite loss (#821)
- Flux Optimizers should define equality (#823)
- Model optimization fails (NaNs) with Zygote.pullback but works with Tracker.forward (#876)
- more issue labels? (#879)
- Diagonal does not return same size due to broadcast (#890)
- Integrate epochs within Flux.train! (#1058)
- Simplest linear model on housing data w/ Flux (#1122)
- OneHotVector(i, n) when i > n (#1300)
- Support DirectML (#1347)
- Feature request: Modifying Dense Layer to accommodate kernel/bias constraints and kernel/bias regularisation  (#1389)
- Flux.softmax returns wrong result with CuArray (#1425)
- Rethink train design and better callbacks support (#1461)
- tied weights (by transposition) are not tied when sent to gpu (#1504)
- Regularization example from docs can't be differentiated (#1588)
- Tied weights using Flux layers (#1592)
- Import Flux on worker crashes (#1625)
- Unclear wording in "Composing Optimizers" section of docs (#1627)
- Recurrent network interface updates/design (#1678)
- Please do not deprecate Dense(...;initb=...) (#1684)
- Triage Meetings (#1709)
- Differentiating a Model While Resetting the Parameters at every Epoch (#1766)
- BatchNorm on GPU without affine or tracking statistics (#1810)
- Float32 parameters in structs unsupported? (#1817)
-  using Flux -> InitError: IOError: mkdir("/pbs/software/centos-7-x86_64/julia/1.7.0/share/julia/packages/Flux/BPPNj/src/data/../../deps"; mode=0o777): read-only file system (EROFS) (#1839)
- Easy to make mistake with `gpu()` (#1887)
- Inline printing for OneHotArray is not GPU-friendly (#1905)
- Freezing layers at model construction time (#1931)
- Unable to precompile -- "allequal not defined" (#1934)
- Weird Side Effects of `loadparams!` (#1979)
- Taking serialization seriously (#1988)
- Issue with logitcrossentropy on onehotencoded input on GPU (#2002)
- Add default inner constructor to Dense (#2158)
- docs on freezing layers should be ported to the explicit syntax (#2216)
- Loading Flux 0.13.15 for the first time results in error (#2232)
- Default for `init_score` in `early_stopping` (#2639)
- Does not compile anymore in conjunction with CUDA due to dependency on `MLDataDevices` (#2647)

Jan 23, 2026
d15c7dc
zip
tar.gz
Notes

v0.16.7

[Diff since v0.16.6](v0.16.6...v0.16.7)

**Merged pull requests:**
- fix cuda ext for real (#2637) (@CarloLucibello)

**Closed issues:**
- Docker images for Floydhub and similar (#148)
- Implement einsum function/macro à la PyTorch and TF (#297)
- Flux and Images (#326)
- "Tracing" memory pre-allocator (#349)
- make Juno dependency conditional (#454)
- Encoding array dimensions in flux type system? (#614)
- Gradient Interface Design (#628)
- New New Optimisers (#637)
- Clipping (#672)
- CUDA Programming Model (#706)
- LBFGS Optimizer (#719)
- Flux plots (#729)
- "ADAM" and friends should be called "Adam" (#795)
- Add lookahead optimizer (#838)
- ADAM does not accept keyword arguments (#871)
- Compatibility with Tracker (#883)
- Numerical issues for (logit)binarycrossentropy (#914)
- Change abstract argument names to meaningful ASCII (#915)
- Roadmap to Flux 1.0 (#961)
- Zygote gives extra gradient entries for BatchNorm  (#1018)
- Helper methods for extracting RNN final state in a GPU compatible way (#1043)
- helper function for selecting a gpu in multi-gpu setting (#1074)
- Provide iper-simple examples directly in readme.md (#1115)
- gpu function does nothing, but only on first run (#1119)
- Behavior of chunk (#1120)
- ArrayFire (#1126)
- MethodError: no method matching zero(::Type{Array{Float32,2}}) In Flux Loss function  (#1134)
- Parameter collection and GPU movement fail on models defined via functions (#1201)
- Derivative in loss function error (#1464)
- Document OneHotArray (#1519)
- Second order derivative  (#1582)
- `Conv` is not working for Complex when using CUDA  (#1655)
- Flux installation errors in julia 1.7.0-rc1, WSL2 (#1757)
- Two-arg update!(x, d) is never used (#1860)
- cpu() type stability (#1878)

Dec 9, 2025
64c3979
zip
tar.gz
Notes

v0.16.6

[Diff since v0.16.5](v0.16.5...v0.16.6)

**Merged pull requests:**
- Bump actions/checkout from 4 to 5 (#2615) (@dependabot[bot])
- Fix Typos in Docs (#2619) (@Solanasol233)
- Fix reference to inner gradient structure in docs basics.md (#2622) (@AbelHo)
- Fix the "Gradients and Layers" documentation (#2628) (#2629) (@gomurin0428)
- Bump actions/checkout from 5 to 6 (#2630) (@dependabot[bot])
- fix cuda extension on julia 1.12 (#2632) (@CarloLucibello)
- fix CI (#2634) (@CarloLucibello)

**Closed issues:**
- Enzyme gradient example broken (#2554)
- [enzyme] broken Bilinear gradient (#2565)
- [enzyme] broken MultiHeadAttention gradient (#2567)
- Regression: `cpu` function is incompatible with DataFrames since Functors became opt-in (#2617)
- Flux on GPU is incompatible with NVIDIA driver version 13 (#2618)
- Views built within MLUtils.kfolds scalar indexing error on gpu (#2620)
- Simple single layer line fit does not converge for most sets of sample points (#2623)
- Julia 1.12: Deadlock detected in loading Flux ext (#2625)
- Enzyme Documentation example does not work on CPU (#2627)
- Minor documentation issue ("Gradients and Layers") (#2628)
- Deadlock when loading `FluxCUDAcuDNNExt` on Julia 1.12.2 (works on 1.11.7) (#2631)
- [Metal] error in forward pass with tanh activation (#2633)

Dec 8, 2025
19534df
zip
tar.gz
Notes

v0.16.5

[Diff since v0.16.4](v0.16.4...v0.16.5)

**Merged pull requests:**
- Fix Typos in Old Tutorials Documentation (#2610) (@leopardracer)
- CompatHelper: bump compat for AMDGPU in [weakdeps] to 2, (keep existing compat) (#2613) (@github-actions[bot])
- Bump to 0.16.5 (#2614) (@pxl-th)

**Closed issues:**
- `unsafe_free!` from MLDataDevices fails for CuArray{CartesianIndex{4}, 1, CUDA.DeviceMemory}) (#2612)

Jul 23, 2025
461a1b6
zip
tar.gz
Notes

v0.16.4

[Diff since v0.16.3](v0.16.3...v0.16.4)

**Merged pull requests:**
- Fix missing imports in FluxMPIExt (#2589) (@Alexander-Barth)
- Added shape validation for Conv weight tensor (#2590) (@mikymatt01)
- Disable broken Reactant tests for now (#2595) (@ToucheSir)
- Test re enable reactant test to identify failures (#2596) (@wsmoses)
- fix recurrent docstrings (#2597) (@CarloLucibello)
- Fix loading of pooling layers (#2598) (@adrhill)
- Fix typos (#2601) (@omahs)
- small updates to tests (#2602) (@CarloLucibello)
- reinstate enzyme tests (#2603) (@CarloLucibello)
- fix docs CI (#2604) (@CarloLucibello)
- fix piracy with DataLoader (#2608) (@CarloLucibello)

**Closed issues:**
- Reduce hcat creates dense matrix (#1596)
- Update GSoC 2025 Idea List (#2586)
- Type piracy breaks `(dev::AbstractDevice)(d::DataLoader)` (#2592)
- Dropout erroring on CUDA, when using `cu` but not `gpu` (#2594)

Jun 2, 2025
676c816
zip
tar.gz
Notes

v0.16.3

[Diff since v0.16.2](v0.16.2...v0.16.3)

**Merged pull requests:**
- fix `cpu(dataloader)` (#2587) (@CarloLucibello)

**Closed issues:**
- Data loading & preprocessing pipeline feature (#1282)
- Infinite time of gradient (#2585)

Feb 6, 2025
9147e84
zip
tar.gz
Notes

v0.16.2

[Diff since v0.16.1](v0.16.1...v0.16.2)

**Merged pull requests:**
- Update deps & bump to 0.16.1 (#2574) (@pxl-th)

**Closed issues:**
- New Gradients ruin everything (#2580)
- Failure to precompile on 1.12: cannot declare Flux.destructure public; it is already declared exported (#2583)

Jan 21, 2025
009d984
zip
tar.gz
Notes

v0.16.1

[Diff since v0.16.0](v0.16.0...v0.16.1)

**Merged pull requests:**
- Adding RecurrentLayers to ecosystem.md (#2555) (@MartinuzziFrancesco)
- Fixed typo in recurrence documentation (#2556) (@MartinuzziFrancesco)
- Adding return state option to recurrent layers (#2557) (@MartinuzziFrancesco)
- update Schedulers docs (#2560) (@CarloLucibello)
- collapse doc string in layers docs (#2562) (@CarloLucibello)
- fix test enzyme (#2563) (@CarloLucibello)
- Remove 2 items from public, to fix 1.12 (#2569) (@mcabbott)
- Add reactant forward and reverse pass tests (#2576) (@wsmoses)
- cleanup Reactant and Enzyme tests (#2578) (@CarloLucibello)

**Closed issues:**
- cell output is not clearly distinguishable from the state (#2548)
- Flux.cpu and Flux.gpu no longer move data on views (#2553)
- remove usage example of old optimiser (#2558)
- Optimizing over `AbstractMatrix` subtypes (#2559)
- introduce a FlattenLayer (#2561)
- [enzyme] broken MeanPool gradient (#2564)
- [enzyme] broken BatchNorm gradient (#2566)
- [enyzme] broken recurrent cell loss (#2568)

Jan 13, 2025
44695a0
zip
tar.gz
Notes

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v0.16.10

v0.16.9

v0.16.8

v0.16.7

v0.16.6

v0.16.5

v0.16.4

v0.16.3

v0.16.2

v0.16.1

Uh oh!

Tags: FluxML/Flux.jl