[Docs] Update deeprec2304 release images and notes in README.md & RELEASE.md. (DeepRec-AI#865)

liutongxuan · web-flow · commit f7ed7fc3cda8 · 2023-05-19T23:07:04.000+08:00
Signed-off-by: Tongxuan Liu &lt;tongxuan.ltx@alibaba-inc.com&gt;
diff --git a/README.md b/README.md
@@ -4,7 +4,7 @@
 --------------------------------------------------------------------------------
 
 ## **Introduction**
-DeepRec is a high-performance recommendation deep learning framework based on [TensorFlow 1.15](https://www.tensorflow.org/), [Intel-TensorFlow](https://github.com/Intel-tensorflow/tensorflow) and [NVIDIA-TensorFlow](https://github.com/NVIDIA/tensorflow).
+DeepRec is a high-performance recommendation deep learning framework based on [TensorFlow 1.15](https://www.tensorflow.org/), [Intel-TensorFlow](https://github.com/Intel-tensorflow/tensorflow) and [NVIDIA-TensorFlow](https://github.com/NVIDIA/tensorflow). It is hosted in incubation in LF AI & Data Foundation.
 
 
 ### **Background**
@@ -95,13 +95,13 @@ $ pip3 install /tmp/tensorflow_pkg/tensorflow-1.15.5+${version}-cp38-cp38m-linux
 #### Image for CPU
 
 ```
-alideeprec/deeprec-release:deeprec2302-cpu-py38-ubuntu20.04
+alideeprec/deeprec-release:deeprec2304-cpu-py38-ubuntu20.04
 ```
 
 #### Image for GPU CUDA11.6
 
 ```
-alideeprec/deeprec-release:deeprec2302-gpu-py38-cu116-ubuntu20.04
+alideeprec/deeprec-release:deeprec2304-gpu-py38-cu116-ubuntu20.04
 ```
 
 ***
diff --git a/RELEASE.md b/RELEASE.md
@@ -1,3 +1,107 @@
+# Release r1.15.5-deeprec2304
+
+## **Major Features and Improvements**
+
+### **Embedding**
+
+- Suport tf.int32 dtype using feature_column API `tf.feature_column.categorical_column_with_embedding`.
+- Make the rules of export frequencies and versions the same as the rule of export keys.
+- Optimize cuda kernel implementation in GroupEmbedding.
+- Support to read embedding files with mmap and madvise, and direct IO.
+- Add double check in find_wait_free of lockless dense hashmap.
+- Change Embedding init value of version in EV from 0 to -1.
+- Interface 'GetSnapshot()' backward compatibility.
+- Implement CPU GroupEmbedding lookup sparse Op.
+- Make GroupEmbedding compatible with sequence feature_column interface.
+- Fix sp_weights indices calculation error in GroupEmbedding.
+- Add group_strategy to control parallelism of group_embedding.
+
+### **Graph & Grappler Optimization**
+
+- Support SparseTensor as placeholder in Sample-awared Graph Compression.
+- Add Dice fusion grappler and ops.
+- Enable MKL Matmul + Bias + LeakyRelu fusion.
+
+### **Runtime Optimization**
+
+- Avoid unnecessary polling in EventMgr.
+- Reduce lock cost and memory usage in EventMgr when use multi-stream.
+
+### **Ops & Hardware Acceleration**
+
+- Register GPU implementation of int64 type for Prod.
+- Register GPU implementation of string type for Shape, ShapeN and ExpandDims.
+- Optimize list of GPU SegmentReductionOps.
+- Optimize zeros_like_impl by reducing calls to convert_to_tensor.
+- Implement GPU version of SparseSlice Op.
+- Delay Reshape when rank > 2 in keras.layers.Dense so that post op can be fused with MatMul.
+- Implement setting max_num_threads hint to oneDNN at compile time.
+- Implement TensorPackTransH2DOp to improve SmartStage performance on GPU.
+
+### **IO**
+
+- Add tensor shape meta-data support for ParquetDataset.
+- Add arrow BINARY type support for ParquetDataset.
+
+### **Serving**
+
+- Add Dice fusion to inference mode.
+- Enable INFERENCE_MODE in processor.
+- Support TensorRT 8.x in Inference.
+- Add configure filed to control enable TensorRT or not.
+- Add flag for device_placement_optimization.
+- Avoid to clustering feature column related nodes when enable TensorRT.
+- Optimize inference latency when load increment checkpoint.
+- Optimize performance via only place TensorRT ops to gpu device.
+
+### **Environment & Build**
+
+- Support CUDA 12.
+- Update DEFAULT_CUDA_VERSION and DEFAULT_CUDNN_VERSION in configure.py.
+- Move thirdparties from WORKSPACE to workspace.bzl.
+- Update urls corresponding to colm, ragel, aliyun-oss-sdk and uuid.
+
+### **BugFix**
+
+- Fix constant op placing bug for device placement optimization.
+- Fix Nan issue occurred in group_embedding API.
+- Fix SOK not compatible with variable issue.
+- Fix memory leak when update full model in serving.
+- Fix 'cols_to_output_tensors' not setted issue in GroupEmbedding.
+- Fix core dump issue about saving GPU EmbeddingVariable.
+- Fix cuda resource issue in KvResourceImportV3 kernel.
+- Fix loading signature_def with coo_sparse bug and add UT.
+- Fix the bug that the training ends early when the workqueue is enabled.
+- Fix the control edge connection issue in device placement optimization.
+
+### **ModelZoo**
+
+- Modify GroupEmbedding related function usage.
+- Update masknet example with layernorm.
+
+### **Tool & Documents**
+
+- Add tools for remove filtered features in checkpoint.
+- Add Arm Compute Library (ACL) user documents.
+- Update Embedding Variable document to fix initializer config example.
+- Update GroupEmbedding document.
+- Update processor documents.
+- Add user documents for intel AMX.
+- Add TensorRT usage documents.
+- Update documents for ParquetDataset.
+
+More details of features: [https://deeprec.readthedocs.io/zh/latest/](url)
+
+## **Release Images**
+
+### **CPU Image**
+
+`alideeprec/deeprec-release:deeprec2304-cpu-py38-ubuntu20.04`
+
+### **GPU Image**
+
+`alideeprec/deeprec-release:deeprec2304-gpu-py38-cu116-ubuntu20.04`
+
 # Release r1.15.5-deeprec2302
 
 ## **Major Features and Improvements**
diff --git a/docs/docs_en/DeepRec-Compile-And-Install.md b/docs/docs_en/DeepRec-Compile-And-Install.md
@@ -112,7 +112,7 @@ pip3 install /tmp/tensorflow_pkg/tensorflow-1.15.5+${version}-cp38-cp38m-linux_x
 
 x86_64:
 ```
-alideeprec/deeprec-release:deeprec2302-cpu-py38-ubuntu20.04
+alideeprec/deeprec-release:deeprec2304-cpu-py38-ubuntu20.04
 ```
 
 arm64:
@@ -123,5 +123,5 @@ alideeprec/deeprec-release:deeprec2302-cpu-py38-ubuntu22.04-arm64
 **GPU Image with CUDA 11.6**
 
 ```
-alideeprec/deeprec-release:deeprec2302-gpu-py38-cu116-ubuntu20.04
+alideeprec/deeprec-release:deeprec2304-gpu-py38-cu116-ubuntu20.04
 ```
diff --git a/docs/docs_en/Estimator-Compile-And-Install.md b/docs/docs_en/Estimator-Compile-And-Install.md
@@ -44,7 +44,7 @@ DeepRec provide new distributed protocols such as grpc++ and star_server, which
 
 Source Code: [https://github.com/DeepRec-AI/estimator](https://github.com/DeepRec-AI/estimator)
 
-Develop Branch：master, Latest Release Branch: deeprec2302
+Develop Branch：master, Latest Release Branch: deeprec2304
 
 ## Estimator Build
 
diff --git a/docs/docs_en/TFServing-Compile-And-Install.md b/docs/docs_en/TFServing-Compile-And-Install.md
@@ -43,7 +43,7 @@ We provide optimized TFServing which could highly improve performance in inferen
 
 Source Code: [https://github.com/DeepRec-AI/serving](https://github.com/DeepRec-AI/serving)
 
-Develop Branch: master, Latest Release Branch: deeprec2302
+Develop Branch: master, Latest Release Branch: deeprec2304
 
 ## TFServing Build
 
diff --git a/docs/docs_zh/DeepRec-Compile-And-Install.md b/docs/docs_zh/DeepRec-Compile-And-Install.md
@@ -111,7 +111,7 @@ pip3 install /tmp/tensorflow_pkg/tensorflow-1.15.5+${version}-cp38-cp38m-linux_x
 
 x86_64:
 ```
-alideeprec/deeprec-release:deeprec2302-cpu-py38-ubuntu20.04
+alideeprec/deeprec-release:deeprec2304-cpu-py38-ubuntu20.04
 ```
 
 arm64:
@@ -122,7 +122,7 @@ alideeprec/deeprec-release:deeprec2302-cpu-py38-ubuntu22.04-arm64
 **GPU CUDA11.6镜像**
 
 ```
-alideeprec/deeprec-release:deeprec2302-gpu-py38-cu116-ubuntu20.04
+alideeprec/deeprec-release:deeprec2304-gpu-py38-cu116-ubuntu20.04
 ```
 
 ## DeepRec Processor编译打包
diff --git a/docs/docs_zh/Estimator-Compile-And-Install.md b/docs/docs_zh/Estimator-Compile-And-Install.md
@@ -44,7 +44,7 @@
 
 代码库：[https://github.com/DeepRec-AI/estimator](https://github.com/DeepRec-AI/estimator)
 
-开发分支：master，最新Release分支：deeprec2302
+开发分支：master，最新Release分支：deeprec2304
 
 ## Estimator编译
 
diff --git a/docs/docs_zh/TFServing-Compile-And-Install.md b/docs/docs_zh/TFServing-Compile-And-Install.md
@@ -43,7 +43,7 @@
 
 代码库：[https://github.com/DeepRec-AI/serving](https://github.com/DeepRec-AI/serving)
 
-开发分支：master，最新Release分支：deeprec2302
+开发分支：master，最新Release分支：deeprec2304
 
 ## TFServing编译&打包