Tags: FluxML/Flux.jl
Tags
[Diff since v0.16.9](v0.16.9...v0.16.10) **Merged pull requests:** - Bump julia-actions/cache from 2 to 3 (#2654) (@dependabot[bot]) - Add AGENTS.md with instructions for AI agents (#2655) (@CarloLucibello) - Use ParallelTestRunner.jl for parallel test execution (#2656) (@CarloLucibello) - Run Enzyme tests on julia 1.12 (#2659) (@CarloLucibello) - Bump dorny/paths-filter from 3.0.2 to 4.0.1 (#2661) (@dependabot[bot]) - mark all documented methods as public (#2662) (@CarloLucibello) - fix AutoZygote gradient (#2664) (@CarloLucibello) - Use GPUArrays caching allocator in train! (#2665) (@CarloLucibello) - Implement `_show_leaflike(::Tuple{})` to handle parameters consisting of an empty `Tuple` (#2667) (@JoshuaBillson) - Bump codecov/codecov-action from 5 to 6 (#2668) (@dependabot[bot]) - Fix number formatting for PairwiseFusion (#2669) (@hardik-xi11) - CompatHelper: bump compat for cuDNN in [weakdeps] to 6, (keep existing compat) (#2671) (@github-actions[bot]) - CompatHelper: bump compat for CUDA in [weakdeps] to 6, (keep existing compat) (#2672) (@github-actions[bot]) **Closed issues:** - Incorrect link on docs (#2380) - bad formatting for PairwiseFusion docstring (#2406) - Use Caching Allocator in train! (#2636) - all documented functions should be marked as `public` (#2638) - Can't install flux. (#2651) - GSoC 2026 Interest: RL Environments Project (#2663)
[Diff since v0.16.7](v0.16.7...v0.16.8) **Merged pull requests:** - Enable other reactant tests (#2600) (@wsmoses) - fix code in docs too wide (#2641) (@CarloLucibello) - Set `early_stopping` default value for `init_score` to `Inf` (#2642) (@andreuvall) - extend `gradient` to take an ADType argument (#2645) (@CarloLucibello) - allow passing an ADType to train! (#2650) (@CarloLucibello) **Closed issues:** - Local Response Normalisation (#192) - generic_matmul! hit in `back!` because type-promotion in activation function (#613) - Layer Transpose (#635) - add `trainstep!` (#666) - Hypernetwork API (#797) - Optimizer handling of infinite loss (#821) - Flux Optimizers should define equality (#823) - Model optimization fails (NaNs) with Zygote.pullback but works with Tracker.forward (#876) - more issue labels? (#879) - Diagonal does not return same size due to broadcast (#890) - Integrate epochs within Flux.train! (#1058) - Simplest linear model on housing data w/ Flux (#1122) - OneHotVector(i, n) when i > n (#1300) - Support DirectML (#1347) - Feature request: Modifying Dense Layer to accommodate kernel/bias constraints and kernel/bias regularisation (#1389) - Flux.softmax returns wrong result with CuArray (#1425) - Rethink train design and better callbacks support (#1461) - tied weights (by transposition) are not tied when sent to gpu (#1504) - Regularization example from docs can't be differentiated (#1588) - Tied weights using Flux layers (#1592) - Import Flux on worker crashes (#1625) - Unclear wording in "Composing Optimizers" section of docs (#1627) - Recurrent network interface updates/design (#1678) - Please do not deprecate Dense(...;initb=...) (#1684) - Triage Meetings (#1709) - Differentiating a Model While Resetting the Parameters at every Epoch (#1766) - BatchNorm on GPU without affine or tracking statistics (#1810) - Float32 parameters in structs unsupported? (#1817) - using Flux -> InitError: IOError: mkdir("/pbs/software/centos-7-x86_64/julia/1.7.0/share/julia/packages/Flux/BPPNj/src/data/../../deps"; mode=0o777): read-only file system (EROFS) (#1839) - Easy to make mistake with `gpu()` (#1887) - Inline printing for OneHotArray is not GPU-friendly (#1905) - Freezing layers at model construction time (#1931) - Unable to precompile -- "allequal not defined" (#1934) - Weird Side Effects of `loadparams!` (#1979) - Taking serialization seriously (#1988) - Issue with logitcrossentropy on onehotencoded input on GPU (#2002) - Add default inner constructor to Dense (#2158) - docs on freezing layers should be ported to the explicit syntax (#2216) - Loading Flux 0.13.15 for the first time results in error (#2232) - Default for `init_score` in `early_stopping` (#2639) - Does not compile anymore in conjunction with CUDA due to dependency on `MLDataDevices` (#2647)
[Diff since v0.16.6](v0.16.6...v0.16.7) **Merged pull requests:** - fix cuda ext for real (#2637) (@CarloLucibello) **Closed issues:** - Docker images for Floydhub and similar (#148) - Implement einsum function/macro à la PyTorch and TF (#297) - Flux and Images (#326) - "Tracing" memory pre-allocator (#349) - make Juno dependency conditional (#454) - Encoding array dimensions in flux type system? (#614) - Gradient Interface Design (#628) - New New Optimisers (#637) - Clipping (#672) - CUDA Programming Model (#706) - LBFGS Optimizer (#719) - Flux plots (#729) - "ADAM" and friends should be called "Adam" (#795) - Add lookahead optimizer (#838) - ADAM does not accept keyword arguments (#871) - Compatibility with Tracker (#883) - Numerical issues for (logit)binarycrossentropy (#914) - Change abstract argument names to meaningful ASCII (#915) - Roadmap to Flux 1.0 (#961) - Zygote gives extra gradient entries for BatchNorm (#1018) - Helper methods for extracting RNN final state in a GPU compatible way (#1043) - helper function for selecting a gpu in multi-gpu setting (#1074) - Provide iper-simple examples directly in readme.md (#1115) - gpu function does nothing, but only on first run (#1119) - Behavior of chunk (#1120) - ArrayFire (#1126) - MethodError: no method matching zero(::Type{Array{Float32,2}}) In Flux Loss function (#1134) - Parameter collection and GPU movement fail on models defined via functions (#1201) - Derivative in loss function error (#1464) - Document OneHotArray (#1519) - Second order derivative (#1582) - `Conv` is not working for Complex when using CUDA (#1655) - Flux installation errors in julia 1.7.0-rc1, WSL2 (#1757) - Two-arg update!(x, d) is never used (#1860) - cpu() type stability (#1878)
[Diff since v0.16.5](v0.16.5...v0.16.6) **Merged pull requests:** - Bump actions/checkout from 4 to 5 (#2615) (@dependabot[bot]) - Fix Typos in Docs (#2619) (@Solanasol233) - Fix reference to inner gradient structure in docs basics.md (#2622) (@AbelHo) - Fix the "Gradients and Layers" documentation (#2628) (#2629) (@gomurin0428) - Bump actions/checkout from 5 to 6 (#2630) (@dependabot[bot]) - fix cuda extension on julia 1.12 (#2632) (@CarloLucibello) - fix CI (#2634) (@CarloLucibello) **Closed issues:** - Enzyme gradient example broken (#2554) - [enzyme] broken Bilinear gradient (#2565) - [enzyme] broken MultiHeadAttention gradient (#2567) - Regression: `cpu` function is incompatible with DataFrames since Functors became opt-in (#2617) - Flux on GPU is incompatible with NVIDIA driver version 13 (#2618) - Views built within MLUtils.kfolds scalar indexing error on gpu (#2620) - Simple single layer line fit does not converge for most sets of sample points (#2623) - Julia 1.12: Deadlock detected in loading Flux ext (#2625) - Enzyme Documentation example does not work on CPU (#2627) - Minor documentation issue ("Gradients and Layers") (#2628) - Deadlock when loading `FluxCUDAcuDNNExt` on Julia 1.12.2 (works on 1.11.7) (#2631) - [Metal] error in forward pass with tanh activation (#2633)
[Diff since v0.16.4](v0.16.4...v0.16.5) **Merged pull requests:** - Fix Typos in Old Tutorials Documentation (#2610) (@leopardracer) - CompatHelper: bump compat for AMDGPU in [weakdeps] to 2, (keep existing compat) (#2613) (@github-actions[bot]) - Bump to 0.16.5 (#2614) (@pxl-th) **Closed issues:** - `unsafe_free!` from MLDataDevices fails for CuArray{CartesianIndex{4}, 1, CUDA.DeviceMemory}) (#2612)
[Diff since v0.16.3](v0.16.3...v0.16.4) **Merged pull requests:** - Fix missing imports in FluxMPIExt (#2589) (@Alexander-Barth) - Added shape validation for Conv weight tensor (#2590) (@mikymatt01) - Disable broken Reactant tests for now (#2595) (@ToucheSir) - Test re enable reactant test to identify failures (#2596) (@wsmoses) - fix recurrent docstrings (#2597) (@CarloLucibello) - Fix loading of pooling layers (#2598) (@adrhill) - Fix typos (#2601) (@omahs) - small updates to tests (#2602) (@CarloLucibello) - reinstate enzyme tests (#2603) (@CarloLucibello) - fix docs CI (#2604) (@CarloLucibello) - fix piracy with DataLoader (#2608) (@CarloLucibello) **Closed issues:** - Reduce hcat creates dense matrix (#1596) - Update GSoC 2025 Idea List (#2586) - Type piracy breaks `(dev::AbstractDevice)(d::DataLoader)` (#2592) - Dropout erroring on CUDA, when using `cu` but not `gpu` (#2594)
[Diff since v0.16.2](v0.16.2...v0.16.3) **Merged pull requests:** - fix `cpu(dataloader)` (#2587) (@CarloLucibello) **Closed issues:** - Data loading & preprocessing pipeline feature (#1282) - Infinite time of gradient (#2585)
[Diff since v0.16.1](v0.16.1...v0.16.2) **Merged pull requests:** - Update deps & bump to 0.16.1 (#2574) (@pxl-th) **Closed issues:** - New Gradients ruin everything (#2580) - Failure to precompile on 1.12: cannot declare Flux.destructure public; it is already declared exported (#2583)
[Diff since v0.16.0](v0.16.0...v0.16.1) **Merged pull requests:** - Adding RecurrentLayers to ecosystem.md (#2555) (@MartinuzziFrancesco) - Fixed typo in recurrence documentation (#2556) (@MartinuzziFrancesco) - Adding return state option to recurrent layers (#2557) (@MartinuzziFrancesco) - update Schedulers docs (#2560) (@CarloLucibello) - collapse doc string in layers docs (#2562) (@CarloLucibello) - fix test enzyme (#2563) (@CarloLucibello) - Remove 2 items from public, to fix 1.12 (#2569) (@mcabbott) - Add reactant forward and reverse pass tests (#2576) (@wsmoses) - cleanup Reactant and Enzyme tests (#2578) (@CarloLucibello) **Closed issues:** - cell output is not clearly distinguishable from the state (#2548) - Flux.cpu and Flux.gpu no longer move data on views (#2553) - remove usage example of old optimiser (#2558) - Optimizing over `AbstractMatrix` subtypes (#2559) - introduce a FlattenLayer (#2561) - [enzyme] broken MeanPool gradient (#2564) - [enzyme] broken BatchNorm gradient (#2566) - [enyzme] broken recurrent cell loss (#2568)
PreviousNext