Commit a7cfcd4
authored
Release v0.2.1 (#161)
This release includes improvements to the tiling and DMA code
generation, new networks and operators, improved CI workflows, migration
to PyTest, and support for PyPi package releases.
Note: Since the release tag references the Docker container tagged with
the release tag (ghcr.io/pulp-platform/deeploy:v0.2.1), the CI will
initially fail. The Deeploy Docker image must be built after the release
PR is merged and the CI restarted.
### List of Pull Requests
- PyPi Package Deployment + Remove Banshee Dept
[#154](#154)
- PyTest Migration
[#144](#144)
- Update submodule `pulp-nn-mixed`
[#145](#145)
- Improve Profiling
[#138](#138)
- FP32 ReduceMean operator improvement
[#137](#137)
- Support for RMSNorm (Pow and Sqrt operators)
[#136](#136)
- Demo TinyViT compatibility with tiled Siracusa
[#124](#124)
- TinyViT on non-tiled Siracusa
[#117](#117)
- Support Fully Asynchronous DMAs
[#114](#114)
- Disallow shape inference
[#128](#128)
- Remove memory-aware node bindings
[#123](#123)
- Fix missing const's layout transformation and refactor NCHWtoNHWC
passes [#122](#122)
- Fix aliasing [#125](#125)
- Support for 1D Autoencoder
[#98](#98)
- Refactor Logging for Improved Debugging
[#115](#115)
- Add reuse-tool as an SPDX license header linter
[#113](#113)
- Bug fixes, API Cleanup and Reduce Compiler Warning on PULP
[#112](#112)
- Fix PULP GEMM `batch` serialization
[#109](#109)
- Split CI Workflows by Platform and Task, Improve Formatting and
Linting Reliability
[#108](#108)
- Refactor tiling code generation
[#105](#105)
- Change order of typeMatching entries
[#68](#68)
- Node Mangling to avoid duplication
[#93](#93)
- Prepare Post v0.2.0 Release
[#104](#104)
- Use Docker digests instead of arch-specific tags
[#106](#106)
- Fix `Unsqueeze` Op. when using ONNX opset 13 or higher (from attribute
to input) [#119](#119)
- Fix bias hoisting in generic GEMM with no bias
[#126](#126)1,212 files changed
Lines changed: 15740 additions & 21169 deletions
File tree
- .github
- workflows
- .gitlab
- issue_templates
- merge_request_templates
- .vscode
- Container
- DeeployTest
- Platforms
- Chimera
- Generic
- MemPool
- PULPOpen
- inc
- src
- QEMU_ARM
- inc
- src
- Siracusa
- inc
- src
- Snitch
- SoftHier
- Tests
- Adder
- Kernels
- FP32
- Add
- Large
- Regular
- Conv
- DW_2D_Bias
- DW_2D_NoBias
- DW_2D_ZeroValuedBias
- Regular_2D_Bias
- Regular_2D_NoBias
- Regular_2D_ZeroValuedBias
- Div
- GELU
- GEMM
- NoBias
- Regular
- TransB
- LayerNorm
- MatMul
- MaxPool
- Mul
- Pad
- Pow
- Scalar
- Vector
- RMSNorm
- ReLU
- ReduceMean
- KeepDims
- Add_ReduceMean_Add
- Add_ReduceMean
- AllAxes
- Axes1_2_3
- Axes1_3
- Axes2_1
- Axis0
- Axis2
- ReduceMean_Add
- NoKeepDims
- Add_ReduceMean_Add
- Add_ReduceMean
- AllAxes
- Axes1_2_3
- Axes1_3
- Axes2_1
- Axis0
- Axis2
- ReduceMean_Add
- ReduceSum
- Reshape
- Regular
- SkipConnection
- Softmax
- CrossEntropyGrad
- CrossEntropy
- Grad
- Regular
- Sqrt
- Squeeze
- Transpose
- Integer
- Add
- Large
- MultIO
- Regular_RQ
- Regular
- Attention
- Concat
- Conv
- DW_1D
- DW_2D_RQ
- DW_2D
- PW_2D_RQ
- Regular_RQ
- Unsigned_RQ
- PW_2D
- Regular_1D
- Regular_2D_RQ
- Regular_2D
- StriddedPadded_2D_RQ
- GEMM
- Batch_RQ
- Regular_RQPerColumn
- Regular_RQPerRow
- Regular
- TransB_RQ
- Hardswish
- Regular_RQ
- Regular
- Identity
- InputIdentity
- MultiOutputIdentity
- OutputIdentity
- MatMul
- Add
- Batch
- Regular_RQ
- Regular
- MaxPool
- Pad
- Regular_1D
- Regular_2D
- RMSNorm
- ReduceMean
- ReduceSum
- Slice
- Softmax
- Large
- Regular
- TrueIntegerDiv
- iNoNorm
- Mixed
- Dequant
- Quant
- Models
- Autoencoder1D
- CCT_Train
- CCT2_FT1
- CCT2_FT2
- CCT2_LP
- CCT2_LoRA1
- CCT2_LoRA2
- CCT
- FP32
- CCT_1_16_16_128
- CCT_1_16_16_32
- CCT_1_16_16_64
- CCT_1_16_16_8
- CCT_1_32_32_32
- CCT_1_32_32_8
- CCT_2_32_32_128_Opset20
- CCT_2_32_32_128
- Int
- ICCT_8
- ICCT_ITA_8
- ICCT_ITA
- ICCT
- CNN_Linear1
- CNN_Linear2
- EEGFormer
- MLPerf
- AnomalyDetection
- ImageClassification
- KeywordSpotting
- VisualWakeWords
- MobileNetv2
- TinyViT
- 5M/Layers/FP32/ReduceMean
- Demo
- Transformer_DeepQuant
- Transformer_Train
- Transformer
- WaveFormer
- microLlama
- microLlama128
- microLlama16_parallel
- microLlama16
- microLlama1_parallel
- microLlama1
- microLlama256
- microLlama2_parallel
- microLlama2
- microLlama32_parallel
- microLlama32
- microLlama4_parallel
- microLlama4
- microLlama64_parallel
- microLlama64
- microLlama8_parallel
- microLlama8
- miniMobileNetv2
- miniMobileNet
- Others
- Backtracking
- TypeInference
- test2DRequantizedConv
- testFloatReshapeWithSkipConnection
- testTrainCCT/CCT1_Classifier_Training
- CCT_1_16_16_128
- CCT_1_16_16_16
- CCT_1_16_16_32
- CCT_1_16_16_64
- CCT_1_16_16_8
- testUtils
- core
- Deeploy
- CommonExtensions
- CodeTransformationPasses
- NetworkDeployers
- OptimizationPasses
- BindingsOptimizationPasses
- TopologyOptimizationPasses
- TypeCheckers
- EngineExtension
- NetworkDeployers
- OptimizationPasses
- TopologyOptimizationPasses
- FutureExtension
- Bindings
- CodeTransformationPasses
- MemoryLevelExtension
- NetworkDeployers
- OptimizationPasses
- Targets
- Chimera
- Templates
- CortexM
- Templates
- TopologyOptimizationPasses
- Generic
- Templates
- TileConstraints
- TopologyOptimizationPasses
- MemPool
- Templates
- TopologyOptimizationPasses
- Neureka
- Templates
- TileConstraints
- TopologyOptimizationPasses
- PULPOpen
- CodeTransformationPasses
- DMA
- Templates
- TileConstraints
- TopologyOptimizationPasses
- Snitch
- CodeTransformationPasses
- DMA
- Templates
- TileConstraints
- SoftHier
- Templates
- TilingExtension
- CodeTransformationPasses
- LICENSES
- TargetLibraries
- CMSIS
- inc
- src
- Chimera
- inc
- src
- Generic
- inc
- kernel
- src
- MemPool
- cmake
- inc
- kernel
- src
- PULPOpen
- cmake
- inc
- kernel
- src
- third_party
- Snitch
- cmake
- inc
- kernel
- src
- SoftHier
- cmake
- inc
- src
- cmake
- chimera
- cmsis
- generic
- mempool
- pulp
- pulp-open
- siracusa
- snitch
- snitch_cluster
- softhier
- docs
- _templates
- tutorials
- scripts
- toolchain
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
| 5 | + | |
5 | 6 | | |
6 | 7 | | |
7 | | - | |
| 8 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | | - | |
3 | | - | |
4 | | - | |
5 | | - | |
6 | | - | |
7 | | - | |
8 | | - | |
9 | | - | |
10 | | - | |
11 | | - | |
12 | | - | |
13 | | - | |
14 | | - | |
15 | | - | |
16 | | - | |
17 | | - | |
18 | | - | |
19 | | - | |
20 | | - | |
21 | | - | |
22 | | - | |
23 | | - | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
24 | 24 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | 3 | | |
4 | | - | |
| 4 | + | |
5 | 5 | | |
6 | 6 | | |
7 | | - | |
| 7 | + | |
8 | 8 | | |
9 | 9 | | |
10 | | - | |
| 10 | + | |
11 | 11 | | |
12 | 12 | | |
13 | 13 | | |
| |||
0 commit comments