Releases: PaddlePaddle/PaddleFlow
Releases · PaddlePaddle/PaddleFlow
v0.14.7-rc1
What's Changed
- [pipeline] add metrics for pipeline, run and schedule by @HaozhengAN in #1015
- simplify afs mount ins by @caizejun in #1039
- feature: hdfs chown support by @luoyuedong in #1038
- [pipeline] cherry-pick from 145: fatal error: concurrent map writes (#1026) by @HaozhengAN in #1035
- pread use too many mem by @caizejun in #1042
- feat(job): generate log url for paddleflow job by @D0m021ng in #1033
- feature: all filesystem support chown and chmod by @luoyuedong in #1046
- feature: all filesysetem support chown and chmod by @luoyuedong in #1048
- perf: optimize s3 stat and write by @luoyuedong in #1049
- feat(job): adjust log url format by @D0m021ng in #1050
- bos can not get sts with properity sts not equal true by @caizejun in #1052
- fix: The name of pv is inconsistent when creating jobs concurrently by @luoyuedong in #1058
- fs client remove all have not check if a entry is dir by @caizejun in #1059
- Revert "fs client remove all have not check if a entry is dir" by @hysunflower in #1061
- fix: sts delete bug by @luoyuedong in #1060
- [pipeline]: bugfix: no resume for run by @HaozhengAN in #1055
- paddleflow-deployment.yaml 1.4.5.2,Add operation - n paddleflow by @liujiaxing123666 in #1067
- fixbug: support init k3s, and fix create job create ns failed by @qiaoshuangshuang in #1070
- fs client remove all have not check if a entry is dir by @caizejun in #1064
- fix: sts bug by @luoyuedong in #1073
- fix: minio list bug by @luoyuedong in #1074
- fix: sts duration cause panic bug by @luoyuedong in #1077
- close dir do not need to reduce filehandler count by @caizejun in #1079
- fix: mount pod cpu and memory limit bug by @luoyuedong in #1082
- CFS Mkdir perm by @luoyuedong in #1085
- s3 api not allow to delete over 1k obj at once, rename op may trigger… by @caizejun in #1081
- feature: csi mount default with 0777 by @luoyuedong in #1091
- 【deploy】Add files via upload by @liujiaxing123666 in #1089
- fix start bos by sts by @caizejun in #1093
- docs: add fs create docs by @luoyuedong in #1099
- add demo by @hysunflower in #1102
- more write error log by @caizejun in #1101
- 【deploy】add yaml deploy files by @liujiaxing123666 in #1106
- feature: update go.mod by @luoyuedong in #1107
- 【dockerfile】add glusterfs dockerfile by @liujiaxing123666 in #1109
- 【dockerfile】update filename by @hysunflower in #1110
- 【deploy】paddleflow dockerfile Integrated glusterfs and pfs-csi-plugin resources by @liujiaxing123666 in #1112
- 【dockerfile】update Dockerfile.gluster user by @hysunflower in #1113
- 【deploy】volcano-deploy.yaml root-equota subsequence by @liujiaxing123666 in #1115
- paddleflow-deployment.yaml split crd by @liujiaxing123666 in #1116
- utime bug. write wrong time when utime files by @caizejun in #1117
- fix: readdir may be read cache without not expire and update go mode … by @luoyuedong in #1118
- readdir dels unexpect entries by @caizejun in #1119
- paddleflow-server Dockerfile base paddleflow/ubuntu:latest by @liujiaxing123666 in #1127
- fix: sts subpath=/ err by @luoyuedong in #1125
- feat(queue): get the namespace of queue from annotations when sync volcano queue by @D0m021ng in #1135
- Python client可用获取版本信息、客户端信息 by @tornado404 in #1095
- feat(runtime): optimize runtime interface by @D0m021ng in #1137
- feat(job): update job storage interface by @D0m021ng in #1138
- fix bug for unmarshall(default float -> json.Number) by @Mo-Xianyuan in #1139
- feat(job): extract job type and framework when job event detected by @D0m021ng in #1144
- feat(job): update kubernetes runtime job interface getter by @D0m021ng in #1145
- feat(job plugin): update job plugin register by @D0m021ng in #1146
- feat(runtime): change the parameter of runtime client interface by @D0m021ng in #1147
- feat(queue plugin): update queue plugin register by @D0m021ng in #1148
- feat(job): update job event handler and job struct by @D0m021ng in #1149
- feat(job): add kubeflow PaddleJob by @D0m021ng in #1150
- feat(job): remove useless runtime client interface by @D0m021ng in #1151
- feat(log): update log url for job by @D0m021ng in #1154
Full Changelog: v0.14.6...v0.14.7-rc1
v0.14.6 🌈
Exciting Changes
- s3 api not allow to delete over 1k obj at once, rename op may trigger… @caizejun (#1081)
- CFS Mkdir perm @luoyuedong (#1085)
- fix: mount pod cpu and memory limit bug @luoyuedong (#1082)
- close dir do not need to reduce filehandler count @caizejun (#1079)
- fix: sts duration cause panic bug @luoyuedong (#1077)
- fix: minio list bug @luoyuedong (#1074)
- fix: sts bug @luoyuedong (#1073)
- fs client remove all have not check if a entry is dir @caizejun (#1064)
- fixbug: support init k3s, and fix create job create ns failed @qiaoshuangshuang (#1070)
- paddleflow-deployment.yaml 1.4.5.2,Add operation - n paddleflow @liujiaxing123666 (#1067)
- [pipeline]: bugfix: no resume for run @HaozhengAN (#1055)
- fix: sts delete bug @luoyuedong (#1060)
- Revert "fs client remove all have not check if a entry is dir" @hysunflower (#1061)
- fs client remove all have not check if a entry is dir @caizejun (#1059)
- fix: The name of pv is inconsistent when creating jobs concurrently @luoyuedong (#1058)
- bos can not get sts with properity sts not equal true @caizejun (#1052)
- feat(job): adjust log url format @D0m021ng (#1050)
- perf: optimize s3 stat and write @luoyuedong (#1049)
- feature: all filesysetem support chown and chmod @luoyuedong (#1048)
- feature: all filesystem support chown and chmod @luoyuedong (#1046)
- feat(job): generate log url for paddleflow job @D0m021ng (#1033)
- pread use too many mem @caizejun (#1042)
- [pipeline] cherry-pick from 145: fatal error: concurrent map writes (#1026) @HaozhengAN (#1035)
- feature: hdfs chown support @luoyuedong (#1038)
- simplify afs mount ins @caizejun (#1039)
- [pipeline] add metrics for pipeline, run and schedule @HaozhengAN (#1015)
- fix: csi secret env @luoyuedong (#1034)
- fix: csi mount pod need add secret key @luoyuedong (#1032)
- feature: add secret key in paddleflow and csi support csi @luoyuedong (#1031)
- feature: add bos sts type @luoyuedong (#1016)
- fix warmup bug @caizejun (#1030)
- feat(cluster resources): support list cluster resources by queue @D0m021ng (#1011)
- Test op @liujiaxing123666 (#1029)
- fix: cache path mont propagation @luoyuedong (#1022)
- feature: add fs cache go-sdk @luoyuedong (#1014)
- feature: add 1.13 csi-plugin install docs @luoyuedong (#1013)
- feature: add sts paddleflow server @luoyuedong (#1008)
- [pipeline] opt return code and return value of some api according to testi result @HaozhengAN (#1010)
- feature: add k3s runtime, and only support single job @qiaoshuangshuang (#987)
- add ns check and create if not exist @tornado404 (#1005)
- feat(job): support aware of request resources is changed while pod is running @D0m021ng (#1006)
- feature: add afs mount @luoyuedong (#1004)
- feat(monitor): add grafana dashboard for volcano @D0m021ng (#1003)
- Revert "Revert "reduce dockerfile and support muti-arch binary and image"" @tornado404 (#1001)
- fix: hdfs abcExecption bug fix @luoyuedong (#998)
- Revert "reduce dockerfile and support muti-arch binary and image" @tornado404 (#1000)
- reduce dockerfile and support muti-arch binary and image @tornado404 (#986)
- [pieline] bugfix: fix bug that root user cannot user the fs which belong to other user @HaozhengAN (#996)
- fix: cfs mount bug @luoyuedong (#995)
- [pipeline] optimize the msg of Run @HaozhengAN (#993)
- feature(job): add patch for custom single job @D0m021ng (#992)
- fix: writer close bug @luoyuedong (#989)
- [pipeline]: Optimize interface return value @HaozhengAN (#975)
- feature: cfs mount @luoyuedong (#988)
- fix write not flush, reader read emtpy and meta size not zero bug @luoyuedong (#985)
- reduce relations between job and task Listener @tornado404 (#983)
- security fix, fix python cli pyyaml from 5.3 to 5.4 @qiaoshuangshuang (#984)
- fix reader read err maybe cause dead read loop @luoyuedong (#980)
- improve job handler by dynamic watch @tornado404 (#981)
- delete accesskey @caizejun (#978)
- fix stop job failed @tornado404 (#976)
- fix append error @luoyuedong (#973)
- fix autopatch resource @tornado404 (#970)
- IsSkipResourceValidate @tornado404 (#971)
- Fix vc queue weight is empty @D0m021ng (#968)
- [pipeline] fix bug: there is no endTime where job is terminated @HaozhengAN (#966)
- feat(job): refactor runtime job struct @D0m021ng (#965)
- support env limit @tornado404 (#959)
- feat(ut): remove useless spark apis code @D0m021ng (#963)
- feat(job): adjust job event handler code @D0m021ng (#962)
- delete file @luoyuedong (#961)
- fix: hdfs open and create maybe conflict @luoyuedong (#958)
- feature: add mount warmup tool and increase gc interval @luoyuedong (#954)
- support using default queue @tornado404 (#956)
- bug fix: can't read too many direntrys @caizejun (#953)
- Fix: return alloctable gpu resoruces with uniform format @D0m021ng (#952)
- fix: hdfs append bug @luoyuedong (#948)
- fix log range @tornado404 (#949)
- add log client @tornado404 (#947)
- add paddle job resource support when using extensiontemplate @tornado404 (#944)
What's Changed
- add paddle job resource support when using extensiontemplate by @tornado404 in #944
- add log client by @tornado404 in #947
- fix log range by @tornado404 in #949
- fix: hdfs append bug by @luoyuedong in #948
- Fix: return alloctable gpu resoruces with uniform format by @D0m021ng in #952
- bug fix: can't read too many direntrys by @caizejun in #953
- support using default queue by @tornado404 in #956
- feature: add mount warmup tool and increase gc interval by @luoyuedong in #954
- fix: hdfs open and create maybe conflict by @luoyuedong in #958
- delete file by @luoyuedong in #961
- feat(job): adjust job event handler code by @D0m021ng in #962
- feat(ut): remove useless spark apis code by @D0m021ng in #963
- support env limit by @tornado404 in #959
- feat(job): refactor runtime job struct by @D0m021ng in #965
- [pipeline] fix bug: there is no endTime where job is terminated by @HaozhengAN in #966
- Fix vc queue weight is empty by @D0m021ng in #968
- IsSkipResourceValidate by @tornado404 in #971
- fix autopatch resource by @tornado404 in #970
- fix append error by @luoyuedong in #973
- fix stop job failed by @tornado404 in #976
- delete accesskey by @caizejun in #978
- improve job handler by dynamic watch by @tornado404 in #981
- fix reader read err maybe cause dead read loop by @luoyuedong in #980
- security fix, fix python cli pyyaml from 5.3 to 5.4 by @qiaoshuangshuang in #984
- reduce relations between job and task Listener by @tornado404 in #983
- fix write not flush, reader read emtpy and meta size not zero bug by @luoyuedong in #985
- feature: cfs mount by @luoyuedong in #988
- [pipeline]: Optimize interface return value by @HaozhengAN in #975
- fix: writer close bug by @luoyuedong in #989
- feature(job): add patch for custom single job by @D0m021ng in #992
- [pipeline] optimize the msg of Run by @HaozhengAN in #993
- fix: cfs mount bug by @luoyuedong in #995
- [pieline] bugfix: fix bug that root user cannot user the fs which belong to other user by @HaozhengAN in #996
- reduce dockerfile and support muti-arch binary and image by @tornado404 in #986
- Revert "reduce dockerfile and support muti-arch binary and image" by @tornado404 in #1000
- fix: hdfs abcExecption bug fix by @luoyuedong in #998
- Revert "Revert "reduce dockerfile and support muti-arch binary and image"" by @tornado404 in #1001
- feat(monitor): add grafana dashboard for volcano by @D0m021ng in #1003
- feature: add afs mount by @luoyuedong in #1004
- feat(job): support aware of request resources is changed while pod is running by @D0m021ng in #1006
- add ns check and create if not exist by @tornado404 in #1005
- feature: add k3s runtime, and only support single job by @qiaoshuangshuang in #987
- [pipeline] opt return code and return value of some api according to testi result by @HaozhengAN in #1010
- feature: add sts ...
fix fatal bug in workflow
What's Changed
- [pipeline] bug fix: fatal error: concurrent map writes by @HaozhengAN in #1026
Full Changelog: v0.14.5.3...v0.14.5.4
improve Robustness
What's Changed
- fix: develop bug by @luoyuedong in #990
- fix: cache path mont propagation by @luoyuedong in #1021
Full Changelog: v0.14.5.2...v0.14.5.3
v0.14.5.2
What's Changed
- support using default queue (#956) by @tornado404 in #957
- bug fix: can't read too many direntrys by @caizejun in #955
- fix: hdfs readdir bug by @luoyuedong in #960
- support env limit cherry pick by @tornado404 in #964
- [pipeline] fix bug: there is no endTime where job is terminated (#966) by @HaozhengAN in #967
- fix autopatch resource (#970) by @tornado404 in #972
- fix mount pod default mem limit and cpu limit by @luoyuedong in #977
- delete accesskey by @caizejun in #979
- fix append bug by @luoyuedong in #974
Full Changelog: v0.14.5.1...v0.14.5.2
v0.1.4.5.1
v0.1.4.5.1
v0.14.6-beta0
What's Changed
- fix branch name and tag name is wrong by @tornado404 in #888
- [pipeline]: change the regex patter of pipeline name and run name by @HaozhengAN in #886
- fix: hdfs read bug by @luoyuedong in #891
- feat(runtime): remove runtime v1 from job manager by @D0m021ng in #890
- feat(runtime): remove runtime v1 by @D0m021ng in #894
- update docs by @tornado404 in #895
- 【pipeline】 add FAQ by @HaozhengAN in #898
- feat(cluster): add cache for cluster resources by @D0m021ng in #899
- feature: uninstall paddlelfow storage by @caizejun in #841
- feat(cluster): update cluster resources cache by @D0m021ng in #901
- feat(cluster): init cluster resource cache by @D0m021ng in #902
- feat(cluster resources): add listener for kubernetes node and pod by @D0m021ng in #905
- feat(runtime): adjust runtime v2 unit test by @D0m021ng in #906
- feat: add file_client s3 unit test by @caizejun in #904
- update gorm version by @tornado404 in #908
- [pipeline] update createPipeline and UpdatePipeline API: add YamlRaw parameter by @HaozhengAN in #900
- feat(cluster): add node resource sync controller by @D0m021ng in #907
- fix: close filehandles panic by @luoyuedong in #911
- add resource list api by @tornado404 in #909
- feat(cluster resources): adjust pod update by @D0m021ng in #912
- feat(cluster resources): update cluster resources cache by @D0m021ng in #914
- bugfix: s3 rename failed by @caizejun in #913
- feat(cluster resources): delete resources when task is completed by @D0m021ng in #915
- feat(cluster resources): support store shared gpu device index by @D0m021ng in #918
- feat(common): adjust memory resource name by @D0m021ng in #919
- fix: delete cache file may be thread exhaustion by @luoyuedong in #920
- feat(cluster resources): adjust cluster resources api by @D0m021ng in #921
- [pipeline] createRun support failureOptions by @HaozhengAN in #924
- [pipeline]: Fix bugs that TestRestartEntry fail occasionally by @HaozhengAN in #926
- support image and resource patch when customJobYaml by @tornado404 in #922
- Fix: update allocated task resources failed by @D0m021ng in #932
- feat(runtime): add feature gate for JobSync and QueueSync by @D0m021ng in #933
- add mpiJob support by @tornado404 in #925
- feat(job): add more unit test for job by @D0m021ng in #936
- add pods logs support in kube runtime by @tornado404 in #934
- fix mpi jobmode to master worker by @tornado404 in #937
- add logs api for pods by @tornado404 in #940
- Fix: update node labels failed by @D0m021ng in #942
- feat(job): remove useless code by @D0m021ng in #943
Full Changelog: v0.14.5...v0.14.6-beta0
v0.14.5
What's Changed
- Fs by @ElsieFan in #639
- csi: request = 0 by @ElsieFan in #640
- Affinity2 by @ElsieFan in #645
- there is no class name Match in python3.6's re module by @HaozhengAN in #643
- feat: namespace code refactoring by @luoyuedong in #646
- feat(job runtime): add framework interface and client for runtime v2 by @D0m021ng in #641
- feat(job api): adjust PFJob structure by @D0m021ng in #647
- add ray job crd by @tornado404 in #653
- feat: add readdir lookup getattr access by @luoyuedong in #656
- feat: add mkdir read write release by @luoyuedong in #659
- pfs-fuse: ufs decouple with nodefs.File(third-party definition) by @ElsieFan in #660
- update gopkg/yaml v2 to v3 by @StevenYangCao in #661
- feat: add ufs_new by @luoyuedong in #662
- feat(job): add single job implement by @D0m021ng in #650
- (feat: job config) add all in one job template yaml file by @D0m021ng in #663
- Fd by @ElsieFan in #664
- pfs-fuse: bugfix buffer pool by @ElsieFan in #665
- Fix: init default-queue failed when queue is already exist by @D0m021ng in #666
- Fix: builtin single job cannot read job template by @D0m021ng in #667
- Rayjob apiserver by @tornado404 in #654
- fix command nil (#648) by @tornado404 in #670
- feat: namespace code refactoring by @caizejun in #672
- doc: fs cache node affinity by @ElsieFan in #673
- 【pipeline-dsl】: 修复post_process 没有同步pipeline 的环境变量的bug by @HaozhengAN in #668
- 【pipeline】update dsl docs and example by @HaozhengAN in #657
- feat: add rmdir setattr by @luoyuedong in #676
- S3 by @ElsieFan in #677
- fix workdir absent by @tornado404 in #678
- feat(job sync): add runtime v2 job sync by @D0m021ng in #671
- Rayjob member by @tornado404 in #674
- location-awareness: fix affinity nil ptr by @ElsieFan in #683
- update extension_template from text to mediumtext, length from 65535 … by @tornado404 in #684
- feat(runtime): add kubernetes runtime by @D0m021ng in #682
- feat(runtime): add kubernetes job log by @D0m021ng in #688
- feat(runtime): update single job by @D0m021ng in #691
- feat: add file test and rename by @luoyuedong in #692
- feat(runtime): runtime v2 support PaddleJob and PyTorchJob by @D0m021ng in #694
- feat(runtime): adjust FrameworkVersion by @D0m021ng in #695
- feat(runtime): add runtime sample by @D0m021ng in #696
- feat(runtime queue): runtime v2 support queue manager by @D0m021ng in #701
- feat(runtime job): change JobBuilder to JobPlugin by @D0m021ng in #707
- bugfix: runtime_info used before initializing by @StevenYangCao in #710
- bugfix: imp entry expire and fs_name limit by @caizejun in #711
- fix: setattr bug by @luoyuedong in #706
- fix bug by @8olio8 in #713
- feat(runtime): runtime v2 support SparkApplication by @D0m021ng in #714
- feat: update the length of fs_id filed to 200 by @caizejun in #700
- fix: corruent file system bug by @luoyuedong in #716
- bugfix: DeleteRun used 3 transaction by @StevenYangCao in #719
- feat(runtime): adjust plugin manager by @D0m021ng in #721
- feat(runtime): adjust event listener register by @D0m021ng in #723
- feat(runtime): runtime v2 support queue sync by @D0m021ng in #724
- feature: add info log by @luoyuedong in #722
- feat(runtime): add switch to runtime v2 by @D0m021ng in #728
- feat: add gpu exporter by @kiritoxkiriko in #658
- feat(runtime): runtime v2 support TFJob by @D0m021ng in #729
- location-awareness: pod patch cache stats by @ElsieFan in #731
- feat(runtime): runtime v2 support ArgoWorkflowJob by @D0m021ng in #732
- bugfix: get fsName and username by fsId by @caizejun in #727
- [pipeline-Bugfix]: "fix bug for calculate latest time by @HaozhengAN in #734
- feat(runtime): adjust buildSchedulingPolicy by @D0m021ng in #737
- fix: s3 write retry bug by @luoyuedong in #742
- update regular expression;add llustration of s3 storage when cache is enable by @HaozhengAN in #744
- location-awareness: server sync cache stats from mount pods to db by @ElsieFan in #733
- version by @ElsieFan in #745
- feat(build): add GitBranch when build paddleflow server by @D0m021ng in #752
- mount pod cleaning vs cache update by @ElsieFan in #753
- csi version equals to branch name by @ElsieFan in #754
- csi: mount pod mount annotation by @ElsieFan in #760
- feat(version): add paddleflow server version api by @D0m021ng in #759
- job: nodeAffinity merge by @ElsieFan in #758
- feat(version): python client support print paddleflow server version by @D0m021ng in #764
- fix: fs S3 bug and data cache read bug by @luoyuedong in #767
- bugfix: patch mount pod by @ElsieFan in #771
- imp: fuse sub path by @caizejun in #770
- feat: upgrade go version by @luoyuedong in #777
- feat: upgrade pfs-fuse by @luoyuedong in #778
- feat: add clean storage shellscript by @caizejun in #781
- csi: fscache update by @ElsieFan in #779
- csi: fscache update by @ElsieFan in #782
- bugfix: add pathCache default expire time by @caizejun in #783
- feat: upgrade pfs-fuse by @luoyuedong in #784
- feat(job): update job template and remove useless code by @D0m021ng in #785
- feat(job): runtime v1 use all-in-one job template by @D0m021ng in #788
- feat(job): create job with local fs by @D0m021ng in #791
- feat: add local type for hostpath by @luoyuedong in #792
- rayJob commits which cherry pick from 1.4.4.1 by @tornado404 in #793
- csi: mount pod containers by @ElsieFan in #794
- feat(job): runtime v2 kubernetes job support hostPath by @D0m021ng in #796
- csi: mount pod cache-worker by @ElsieFan in #795
- feat(monitor): add node gpu monitor by @D0m021ng in #801
- delete fs check mount by @ElsieFan in #804
- docs: fs node affinity by @ElsieFan in ...
v0.14.4
What's Changed
- convert flavour Gi to G by @tornado404 in #539
- fix: cmd error by @luoyuedong in #542
- Dev by @StevenYangCao in #544
- csi mount Info by @ElsieFan in #546
- fix: concurrent map read and map write when covert job status by @D0m021ng in #550
- Restore by @ElsieFan in #548
- Fix bug that server panic when sync single job by @D0m021ng in #555
- server config host & port by @ElsieFan in #552
- storage refactor - transaction by @ElsieFan in #557
- Go SDK 1.4.3 by @StevenYangCao in #549
- run.Name changes to ctx.Name from UserName setted in request by @StevenYangCao in #556
- �增加armdockerfile by @liujiaxing123666 in #559
- update pipeline example by @HaozhengAN in #553
- add 5 level PriorityClass by @tornado404 in #561
- Refactor storage - job by @ElsieFan in #558
- fix: mountpoint recover bug by @luoyuedong in #565
- csi restore mountpoint - mount by process by @ElsieFan in #569
- csi restore mountpoint - pod mount by @ElsieFan in #570
- csi restore mountpoint - pod mount by @ElsieFan in #571
- doc - csi mountpoint recovery by @ElsieFan in #573
- feat: job sync support kubeflow PyTorchJob, TFJob, and MPIJob by @D0m021ng in #572
- disable go proxy by @D0m021ng in #574
- ListRun add statusFilter && ListRun CLI/SDK consistent to API by @StevenYangCao in #566
- feat: support mxnet framework by @D0m021ng in #577
- create resources request by @liujiaxing123666 in #562
- job: pod mount propagation by @ElsieFan in #578
- feat: add python client for cce by @D0m021ng in #580
- feat: support submitting pytorch job by @D0m021ng in #582
- feat: add job metric service by @kiritoxkiriko in #576
- remove useless code by @D0m021ng in #583
- Resource by @liujiaxing123666 in #581
- Python sdk, including Run, Cache, Artifact, Pipeline by @StevenYangCao in #579
- fix: kernerl cache expire fix by @luoyuedong in #587
- add uninstaller by @Mo-Xianyuan in #586
- [pipeline] update docs of pipeline by @HaozhengAN in #575
- python sdk: run, pipeline, schedule by @StevenYangCao in #585
- fix pipeline fs validate bug by @StevenYangCao in #590
- Feat/add tf job by @kiritoxkiriko in #589
- Bugfix: fix bug for pipeline cache: when enable is False, would not record in db by @HaozhengAN in #592
- pipeline showversion show yaml detail by @StevenYangCao in #593
- fix: mount err by @luoyuedong in #596
- fix(executor): fix builtin distributed job submission bug by @kiritoxkiriko in #595
- move StopJobByID to KubeJob by @D0m021ng in #597
- fix unique index by @D0m021ng in #598
- remove useless code by @D0m021ng in #603
- add RBAC for training job by @tornado404 in #602
- Go sdk:update run, schedule by @StevenYangCao in #600
- storage refactor: image by @ElsieFan in #607
- storage refactor: ppl & ppl_version by @ElsieFan in #608
- change get queue sql by @D0m021ng in #610
- fix listAritfact URL error by @StevenYangCao in #609
- python sdk/client CreatePipeline can used without yaml path by @StevenYangCao in #604
- docs: update docs for queue and cluster by @D0m021ng in #612
- storage refactor: cluster & flavour by @ElsieFan in #611
- storage refactor: queque by @ElsieFan in #613
- storage refactor: run_artifact by @ElsieFan in #614
- csi recovery: source paht by @ElsieFan in #615
- [pipeline-dsl]: support complex structure and multiple fs by @HaozhengAN in #616
- Dsl by @HaozhengAN in #618
- [pipeline-dsl] update error message for init client by @HaozhengAN in #619
- Pipeline SDK/CLI reference doc upgrade to 1.4.3 by @StevenYangCao in #588
- bugFix: Pipeline cannot use list type param by @StevenYangCao in #617
- csi: pod mount add resource limit & request; clean-cache annotation by @ElsieFan in #622
- server: validate fs cache config resource format by @ElsieFan in #623
- add job and queue checks by @tornado404 in #621
- fix(conf): add serviceaccount missed log permission by @kiritoxkiriko in #584
- doc: update-missed-yaml by @kiritoxkiriko in #504
- [pipeline]: bugfix 1. artifact path in pod is not right when main_fs is not abs path, 2. align regular expressions by @HaozhengAN in #625
- imp transaction by @caizejun in #627
- csi mount pod resource by @ElsieFan in #632
- fix flavour update/list by removing clustername in pyclient by @tornado404 in #628
- clean path for workdir by @tornado404 in #631
- feat(metric): add queue metric by @kiritoxkiriko in #629
- feat(installer): add monitor json by @kiritoxkiriko in #630
- Revert by @ElsieFan in #637
- fix sdk problem in cluster_version/flavour_update/job_create by @tornado404 in #635
- Go SDK Get Pipeline add params by @StevenYangCao in #633
- fix command nil by @tornado404 in #648
- bugfix: update job status failed when create after delete job by @D0m021ng in #652
- cherry-pick from Fix: builtin single job cannot read job template (#667) by @tornado404 in #675
- Cherry-Pick(#666) Fix: init default-queue failed when queue is already exist by @D0m021ng in #679
- cherry pick from fix workdir absent (#678) by @tornado404 in #681
- update extension_template from text to mediumtext by @tornado404 in #686
New Contributors
- @liujiaxing123666 made their first contribution in #559
Full Changelog: v0.14.3...v0.14.4
v0.14.3
Features
Storage Module
Scheduling Module
- Unify all type Jobs submission methods
- Submit jobs to the cluster 10x faster, the maximum delay is 5 seconds
- Support trace log
Pipeline Module
Bug Fix
- fix flavour update failed by python client
- fix failed to update labels of queue