Skip to content

Tags: FluxML/Flux.jl

Tags

v0.16.10

Toggle v0.16.10's commit message
[Diff since v0.16.9](v0.16.9...v0.16.10)

**Merged pull requests:**
- Bump julia-actions/cache from 2 to 3 (#2654) (@dependabot[bot])
- Add AGENTS.md with instructions for AI agents (#2655) (@CarloLucibello)
- Use ParallelTestRunner.jl for parallel test execution (#2656) (@CarloLucibello)
- Run Enzyme tests on julia 1.12 (#2659) (@CarloLucibello)
- Bump dorny/paths-filter from 3.0.2 to 4.0.1 (#2661) (@dependabot[bot])
- mark all documented methods as public (#2662) (@CarloLucibello)
- fix AutoZygote gradient (#2664) (@CarloLucibello)
- Use GPUArrays caching allocator in train! (#2665) (@CarloLucibello)
- Implement `_show_leaflike(::Tuple{})` to handle parameters consisting of an empty `Tuple` (#2667) (@JoshuaBillson)
- Bump codecov/codecov-action from 5 to 6 (#2668) (@dependabot[bot])
- Fix number formatting for PairwiseFusion (#2669) (@hardik-xi11)
- CompatHelper: bump compat for cuDNN in [weakdeps] to 6, (keep existing compat) (#2671) (@github-actions[bot])
- CompatHelper: bump compat for CUDA in [weakdeps] to 6, (keep existing compat) (#2672) (@github-actions[bot])

**Closed issues:**
- Incorrect link on docs (#2380)
- bad formatting for PairwiseFusion docstring (#2406)
- Use Caching Allocator in train! (#2636)
- all documented functions should be marked as `public` (#2638)
- Can't install flux. (#2651)
- GSoC 2026 Interest: RL Environments Project (#2663)

v0.16.9

Toggle v0.16.9's commit message
[Diff since v0.16.8](v0.16.8...v0.16.9)

**Merged pull requests:**
- Support Mooncake 0.5  (#2653) (@yebai)

v0.16.8

Toggle v0.16.8's commit message
[Diff since v0.16.7](v0.16.7...v0.16.8)

**Merged pull requests:**
- Enable other reactant tests (#2600) (@wsmoses)
- fix code in docs too wide (#2641) (@CarloLucibello)
- Set `early_stopping` default value for `init_score` to `Inf` (#2642) (@andreuvall)
- extend `gradient` to take an ADType argument (#2645) (@CarloLucibello)
- allow passing an ADType to train! (#2650) (@CarloLucibello)

**Closed issues:**
- Local Response Normalisation (#192)
- generic_matmul! hit in `back!` because type-promotion in activation function (#613)
- Layer Transpose (#635)
- add `trainstep!` (#666)
- Hypernetwork API (#797)
- Optimizer handling of infinite loss (#821)
- Flux Optimizers should define equality (#823)
- Model optimization fails (NaNs) with Zygote.pullback but works with Tracker.forward (#876)
- more issue labels? (#879)
- Diagonal does not return same size due to broadcast (#890)
- Integrate epochs within Flux.train! (#1058)
- Simplest linear model on housing data w/ Flux (#1122)
- OneHotVector(i, n) when i > n (#1300)
- Support DirectML (#1347)
- Feature request: Modifying Dense Layer to accommodate kernel/bias constraints and kernel/bias regularisation  (#1389)
- Flux.softmax returns wrong result with CuArray (#1425)
- Rethink train design and better callbacks support (#1461)
- tied weights (by transposition) are not tied when sent to gpu (#1504)
- Regularization example from docs can't be differentiated (#1588)
- Tied weights using Flux layers (#1592)
- Import Flux on worker crashes (#1625)
- Unclear wording in "Composing Optimizers" section of docs (#1627)
- Recurrent network interface updates/design (#1678)
- Please do not deprecate Dense(...;initb=...) (#1684)
- Triage Meetings (#1709)
- Differentiating a Model While Resetting the Parameters at every Epoch (#1766)
- BatchNorm on GPU without affine or tracking statistics (#1810)
- Float32 parameters in structs unsupported? (#1817)
-  using Flux -> InitError: IOError: mkdir("/pbs/software/centos-7-x86_64/julia/1.7.0/share/julia/packages/Flux/BPPNj/src/data/../../deps"; mode=0o777): read-only file system (EROFS) (#1839)
- Easy to make mistake with `gpu()` (#1887)
- Inline printing for OneHotArray is not GPU-friendly (#1905)
- Freezing layers at model construction time (#1931)
- Unable to precompile -- "allequal not defined" (#1934)
- Weird Side Effects of `loadparams!` (#1979)
- Taking serialization seriously (#1988)
- Issue with logitcrossentropy on onehotencoded input on GPU (#2002)
- Add default inner constructor to Dense (#2158)
- docs on freezing layers should be ported to the explicit syntax (#2216)
- Loading Flux 0.13.15 for the first time results in error (#2232)
- Default for `init_score` in `early_stopping` (#2639)
- Does not compile anymore in conjunction with CUDA due to dependency on `MLDataDevices` (#2647)

v0.16.7

Toggle v0.16.7's commit message
[Diff since v0.16.6](v0.16.6...v0.16.7)

**Merged pull requests:**
- fix cuda ext for real (#2637) (@CarloLucibello)

**Closed issues:**
- Docker images for Floydhub and similar (#148)
- Implement einsum function/macro à la PyTorch and TF (#297)
- Flux and Images (#326)
- "Tracing" memory pre-allocator (#349)
- make Juno dependency conditional (#454)
- Encoding array dimensions in flux type system? (#614)
- Gradient Interface Design (#628)
- New New Optimisers (#637)
- Clipping (#672)
- CUDA Programming Model (#706)
- LBFGS Optimizer (#719)
- Flux plots (#729)
- "ADAM" and friends should be called "Adam" (#795)
- Add lookahead optimizer (#838)
- ADAM does not accept keyword arguments (#871)
- Compatibility with Tracker (#883)
- Numerical issues for (logit)binarycrossentropy (#914)
- Change abstract argument names to meaningful ASCII (#915)
- Roadmap to Flux 1.0 (#961)
- Zygote gives extra gradient entries for BatchNorm  (#1018)
- Helper methods for extracting RNN final state in a GPU compatible way (#1043)
- helper function for selecting a gpu in multi-gpu setting (#1074)
- Provide iper-simple examples directly in readme.md (#1115)
- gpu function does nothing, but only on first run (#1119)
- Behavior of chunk (#1120)
- ArrayFire (#1126)
- MethodError: no method matching zero(::Type{Array{Float32,2}}) In Flux Loss function  (#1134)
- Parameter collection and GPU movement fail on models defined via functions (#1201)
- Derivative in loss function error (#1464)
- Document OneHotArray (#1519)
- Second order derivative  (#1582)
- `Conv` is not working for Complex when using CUDA  (#1655)
- Flux installation errors in julia 1.7.0-rc1, WSL2 (#1757)
- Two-arg update!(x, d) is never used (#1860)
- cpu() type stability (#1878)

v0.16.6

Toggle v0.16.6's commit message
[Diff since v0.16.5](v0.16.5...v0.16.6)

**Merged pull requests:**
- Bump actions/checkout from 4 to 5 (#2615) (@dependabot[bot])
- Fix Typos in Docs (#2619) (@Solanasol233)
- Fix reference to inner gradient structure in docs basics.md (#2622) (@AbelHo)
- Fix the "Gradients and Layers" documentation (#2628) (#2629) (@gomurin0428)
- Bump actions/checkout from 5 to 6 (#2630) (@dependabot[bot])
- fix cuda extension on julia 1.12 (#2632) (@CarloLucibello)
- fix CI (#2634) (@CarloLucibello)

**Closed issues:**
- Enzyme gradient example broken (#2554)
- [enzyme] broken Bilinear gradient (#2565)
- [enzyme] broken MultiHeadAttention gradient (#2567)
- Regression: `cpu` function is incompatible with DataFrames since Functors became opt-in (#2617)
- Flux on GPU is incompatible with NVIDIA driver version 13 (#2618)
- Views built within MLUtils.kfolds scalar indexing error on gpu (#2620)
- Simple single layer line fit does not converge for most sets of sample points (#2623)
- Julia 1.12: Deadlock detected in loading Flux ext (#2625)
- Enzyme Documentation example does not work on CPU (#2627)
- Minor documentation issue ("Gradients and Layers") (#2628)
- Deadlock when loading `FluxCUDAcuDNNExt` on Julia 1.12.2 (works on 1.11.7) (#2631)
- [Metal] error in forward pass with tanh activation (#2633)

v0.16.5

Toggle v0.16.5's commit message
[Diff since v0.16.4](v0.16.4...v0.16.5)

**Merged pull requests:**
- Fix Typos in Old Tutorials Documentation (#2610) (@leopardracer)
- CompatHelper: bump compat for AMDGPU in [weakdeps] to 2, (keep existing compat) (#2613) (@github-actions[bot])
- Bump to 0.16.5 (#2614) (@pxl-th)

**Closed issues:**
- `unsafe_free!` from MLDataDevices fails for CuArray{CartesianIndex{4}, 1, CUDA.DeviceMemory}) (#2612)

v0.16.4

Toggle v0.16.4's commit message
[Diff since v0.16.3](v0.16.3...v0.16.4)

**Merged pull requests:**
- Fix missing imports in FluxMPIExt (#2589) (@Alexander-Barth)
- Added shape validation for Conv weight tensor (#2590) (@mikymatt01)
- Disable broken Reactant tests for now (#2595) (@ToucheSir)
- Test re enable reactant test to identify failures (#2596) (@wsmoses)
- fix recurrent docstrings (#2597) (@CarloLucibello)
- Fix loading of pooling layers (#2598) (@adrhill)
- Fix typos (#2601) (@omahs)
- small updates to tests (#2602) (@CarloLucibello)
- reinstate enzyme tests (#2603) (@CarloLucibello)
- fix docs CI (#2604) (@CarloLucibello)
- fix piracy with DataLoader (#2608) (@CarloLucibello)

**Closed issues:**
- Reduce hcat creates dense matrix (#1596)
- Update GSoC 2025 Idea List (#2586)
- Type piracy breaks `(dev::AbstractDevice)(d::DataLoader)` (#2592)
- Dropout erroring on CUDA, when using `cu` but not `gpu` (#2594)

v0.16.3

Toggle v0.16.3's commit message
[Diff since v0.16.2](v0.16.2...v0.16.3)

**Merged pull requests:**
- fix `cpu(dataloader)` (#2587) (@CarloLucibello)

**Closed issues:**
- Data loading & preprocessing pipeline feature (#1282)
- Infinite time of gradient (#2585)

v0.16.2

Toggle v0.16.2's commit message
[Diff since v0.16.1](v0.16.1...v0.16.2)

**Merged pull requests:**
- Update deps & bump to 0.16.1 (#2574) (@pxl-th)

**Closed issues:**
- New Gradients ruin everything (#2580)
- Failure to precompile on 1.12: cannot declare Flux.destructure public; it is already declared exported (#2583)

v0.16.1

Toggle v0.16.1's commit message
[Diff since v0.16.0](v0.16.0...v0.16.1)

**Merged pull requests:**
- Adding RecurrentLayers to ecosystem.md (#2555) (@MartinuzziFrancesco)
- Fixed typo in recurrence documentation (#2556) (@MartinuzziFrancesco)
- Adding return state option to recurrent layers (#2557) (@MartinuzziFrancesco)
- update Schedulers docs (#2560) (@CarloLucibello)
- collapse doc string in layers docs (#2562) (@CarloLucibello)
- fix test enzyme (#2563) (@CarloLucibello)
- Remove 2 items from public, to fix 1.12 (#2569) (@mcabbott)
- Add reactant forward and reverse pass tests (#2576) (@wsmoses)
- cleanup Reactant and Enzyme tests (#2578) (@CarloLucibello)

**Closed issues:**
- cell output is not clearly distinguishable from the state (#2548)
- Flux.cpu and Flux.gpu no longer move data on views (#2553)
- remove usage example of old optimiser (#2558)
- Optimizing over `AbstractMatrix` subtypes (#2559)
- introduce a FlattenLayer (#2561)
- [enzyme] broken MeanPool gradient (#2564)
- [enzyme] broken BatchNorm gradient (#2566)
- [enyzme] broken recurrent cell loss (#2568)