Releases: apache/singa
Apache SINGA 3.0.0
3.0.0.rc1
This release includes following changes:
-
Code quality has been promoted by introducing linting check in CI and auto code formatter.
For linting, the tools,cpplint
andpylint
, are used and configured to comply
google coding styles details intool/linting/
.
Similarly, formatting tools,clang-format
andyapf
configured with google coding styles,
are the recommended one for developers to clean code before submitting changes,
details intool/code-format/
. LGTM is enabled on Github for
code quality check; License check is also enabled. -
New Tensor APIs are added for naming consistency, and feature enhancement:
- size(), mem_size(), get_value(), to_proto(), l1(), l2(): added for the sake of naming consistency
- AsType(): convert data type between
float
andint
- ceil(): perform element-wise ceiling of the input
- concat(): concatenate two tensor
- index selector: e.g. tensor1[:,:,1:,1:]
- softmax(in, axis): allow to perform softmax on a axis on a multi-dimensional tensor
-
14 new operators are added into the autograd module: Gemm, GlobalAveragePool, ConstantOfShape,
Dropout, ReduceSum, ReduceMean, Slice, Ceil, Split, Gather, Tile, NonZero, Cast, OneHot.
Their unit tests are added as well. -
14 new operators are added to sonnx module for both backend and frontend:
Gemm,
GlobalAveragePool,
ConstantOfShape,
Dropout,
ReduceSum,
ReduceMean,
Slice,
Ceil,
Split,
Gather,
Tile,
NonZero,
Cast,
OneHot.
Their tests are added as well. -
Some ONNX models are imported into SINGA, including
Bert-squad,
Arcface,
FER+ Emotion,
MobileNet,
ResNet18,
Tiny Yolov2,
Vgg16, and Mnist. -
Some operators now support multidirectional broadcasting,
including Add, Sub, Mul, Div, Pow, PRelu, Gemm -
[Distributed training with communication optimization]. DistOpt
has implemented multiple optimization techniques, including gradient sparsification,
chunk transmission, and gradient compression. -
Computational graph construction at the CPP level. The operations submitted to the Device are buffered.
After analyzing the dependency, the computational graph is created, which is further analyzed for
speed and memory optimization. To enable this feature, use the Module API. -
New website based on Docusaurus. The documentation files are moved to a separate repo [singa-doc]](https://github.com/apache/singa-doc).
The static website files are stored at singa-site. -
DNNL(Deep Neural Network Library), powered by Intel,
is integrated intomodel/operations/[batchnorm|pooling|convolution]
,
the changes is opaque to the end users. The current version is dnnl v1.1
which replaced previous integration of mkl-dnn v0.18. The framework could
boost the performance of dl operations when executing on CPU. The dnnl dependency
is installed through conda. -
Some Tensor APIs are marked as deprecated which could be replaced by broadcast,
and it can support better on multi-dimensional operations. These APIs are
add_column(), add_row(), div_column(), div_row(), mult_column(), mult_row() -
Conv and Pooling are enhanced to support fine-grained padding like (2,3,2,3),
and SAME_UPPER, SAME_LOWER
pad mode and shape checking. -
Reconstruct soonx,
- Support two types of weight value (Initializer and Constant Node);
- For some operators (BatchNorm, Reshape, Clip, Slice, Gather, Tile, OneHot),
move some inputs to its attributes; - Define and implement the type conversion map.
V2.0.0
In this release, we have added the support of ONNX, implemented the CPU operations using MKLDNN, added operations for autograd, and updated the dependent libraries and CI tools.
-
Core components
- [SINGA-434] Support tensor broadcasting
- [SINGA-370] Improvement to tensor reshape and various misc. changes related to SINGA-341 and 351
-
Model components
- [SINGA-333] Add support for Open Neural Network Exchange (ONNX) format
- [SINGA-385] Add new python module for optimizers
- [SINGA-394] Improve the CPP operations via Intel MKL DNN lib
- [SINGA-425] Add 3 operators , Abs(), Exp() and leakyrelu(), for Autograd
- [SINGA-410] Add two function, set_params() and get_params(), for Autograd Layer class
- [SINGA-383] Add Separable Convolution for autograd
- [SINGA-388] Develop some RNN layers by calling tiny operations like matmul, addbias.
- [SINGA-382] Implement concat operation for autograd
- [SINGA-378] Implement maxpooling operation and its related functions for autograd
- [SINGA-379] Implement batchnorm operation and its related functions for autograd
-
Utility functions and CI
- [SINGA-432] Update depdent lib versions in conda-build config
- [SINGA-429] Update docker images for latest cuda and cudnn
- [SINGA-428] Move Docker images under Apache user name
-
Documentation and usability
- [SINGA-395] Add documentation for autograd APIs
- [SINGA-344] Add a GAN example
- [SINGA-390] Update installation.md
- [SINGA-384] Implement ResNet using autograd API
- [SINGA-352] Complete SINGA documentation in Chinese version
-
Bugs fixed
- [SINGA-431] Unit Test failed - Tensor Transpose
- [SINGA-422] ModuleNotFoundError: No module named "_singa_wrap"
- [SINGA-418] Unsupportive type 'long' in python3.
- [SINGA-409] Basic
singa-cpu
import throws error - [SINGA-408] Unsupportive function definition in python3
- [SINGA-380] Fix bugs from Reshape