
张圣林
职 称: 副教授
电子邮箱: zhangsl@nankai.edu.cn
所学专业: 软件工程
导 师: 博士生/硕士生导师
研究方向: 智能运维,服务管理,机器学习
个人简介
张圣林现为南开大学软件学院副教授,副院长,博士生、硕士生导师,先进计算与关键软件(信创)海河实验室双聘研究员。主要研究方向为基于机器学习的智能运维,包括异常检测、故障定位、根因分析和故障预测等。以第一或通讯作者在CCF A/B类会议或期刊发表论文40余篇。谷歌学术被引2300余次。申请国内发明专利24项,其中6项已授权。主持国家自然科学基金项目2项, 中国博士后科学基金项目1项,横向项目20余项(与华为、字节跳动、网商银行、中兴等合作)。获中国电子学会科技进步一等奖(第3完成人)、ISSRE(CCF B) 2024/2023/2018最佳论文奖、清华大学优秀博士学位论文、南开大学第九届“良师益友”称号、华为计算产品线“最佳技术合作教授”、麒麟软件“校企合作突出贡献”奖、天津市科技进步一等奖,入选南开大学“百青计划”,天津市“131”创新型人才培养工程(第三层次)。指导的研究生获得南开大学优秀硕士学位论文1项,南开大学研究生优秀毕业生1项。
教育经历
于2017年获清华大学工学博士学位(计算机科学与技术专业),2012年获西安电子科技大学工学学士学位(网络工程专业)。在攻读博士学位期间,曾经赴佐治亚理工学院学习。 于2014-2017年在百度运维部实习, 并于2018-2019年在阿里巴巴从事访问学者研究。
工作经历
担任中国计算机学会(CCF)A/B类国际会议程序委员会委员16次,CCF高级会员,CCF青年计算机科技论坛(YOCSEF)天津学术委员会副主席(2023-2024),CCF互联网专委常务委员,CCF软件工程专委、服务计算专委执行委员。以执行主席身份在CNCC、CCF中国网络大会、CCF中国数字服务大会组织技术论坛多次,并因此获CCF 2023中国数字服务大会“最佳论坛组织奖”。获得IEEE Outstanding Leadership Award。主编CCF数图焦点文章《智能运维技术》;参与编撰CCF计算机科学前沿丛书《互联网技术十讲》和《中国计算机科技发展报告(2019-2020)》。此外,与清华大学裴丹教授共同发起、并担任首届和第六届技术委员会主席的CCF国际AIOps挑战赛已连续成功举办七届,成为CCF互联网专委会三大年度活动之一,累积1325支队伍参赛,超过10万人次线上或线下参与,促进了我国智能运维的人才培养与技术发展。此外,与清华大学裴丹教授共同发起了CCF OpenAIOps社区,致力于通过开放的社区合作与群体智慧实现协同创新。
科研项目、成果、获奖、专利等情况
科研项目:
1.面向多模态数据的大规模云平台故障诊断机制研究,国家自然科学基金面上项目,2023.1-2026.12
2.面向多语法语义日志的数据中心网络设备异常检测机制研究,国家自然科学基金青年基金项目,2020.1-2022.12
3.基于日志的数据中心网络设备异常检测机制研究,中国博士后科学基金面上项目,2019.6-2021.5
4.AI集群光链路预测性智能运维技术合作项目,华为公司合作项目
5.故障Agent知识理解及应用技术合作项目,华为公司合作项目
6.面向Serverless基础设施单机稳定性的智能异常诊断与根因分析的研究与实践,阿里巴巴合作项目
7.AIOps算法技术研究与场景应用,字节跳动合作项目
8.大规模分布式系统的亚健康检测与定位技术合作项目,华为云合作项目
9.多模态数据的故障识别与根因定位项目,华为公司合作项目
10.面向云原生系统故障的智能诊断,中兴公司合作项目
11.基于知识图谱的多态失败日志根因定位机制,CCF-华为胡杨林基金(软件工程专项)
12.面向大规模数据中心的网络故障诊断与自愈研究,CCF-腾讯犀牛鸟创意基金项目
13.云原生环境可观测性-系统隐患发现与故障树构建技术研究合作项目,华为公司合作项目
14.集群通信故障诊断技术研究项目,华为公司合作项目
15.基于图推理的分布式系统故障定位技术研究,网商银行合作项目
16.面向数据中心网络设备的智能异常检测,中兴公司合作项目
17.OS故障诊断项目,华为公司合作项目
18.智能变更评估技术合作项目,华为公司合作项目
19.面向机器整体异常的无监督机器聚类和多KPI异常检测模型,字节跳动合作项目
20.下一代互联网交换机故障预测机制研究,赛尔网络下一代互联网技术创新项目
21.基于日志的数据中心交换机故障预测机制研究,中央高校基本科研业务费专项资金资助项目
22.AI运维联合创新技术项目,百度公司合作项目
发表论文:
2025
1. Lei Tao, Minghua Ma, Shenglin Zhang*, Junhua Kuang, Xiao-Wei Guo, Canqun Yang, Dan Pei. Real-Time Anomaly Detection for Large-Scale Network Devices. IEEE/ACM Transactions on Networking (ToN), 2025 (CCF A)
2. Shenglin Zhang, Sibo Xia, Wenzhao Fan, Binpeng Shi, Xiao Xiong, Zhenyu Zhong, Minghua Ma, Yongqian Sun*, Dan Pei. Failure Diagnosis in Microservice Systems: A Comprehensive Survey and Analysis. ACM Transactions on Software Engineering and Methodology (TOSEM), 2025 (CCF A)
3. Yongqian Sun, Jiaju Wang, Zhengdan Li, Xiaohui Nie, Minghua Ma, Shenglin Zhang*, Yuhe Ji, Lu Zhang, Wen Long, Yongnan Luo, Hengmao Chen, Dan Pei. AIOpsArena: Scenario-Oriented Evaluation and Leaderboard for AIOps Algorithms in Microservices. IEEE Software Analysis, Evolution and Reengineering (SANER), 2025 (CCF-B)
2024
1. Shenglin Zhang, Ting Xu, Jun Zhu, Yongqian Sun*, Pengxiang Jin, Binpeng Shi, Dan Pei. Privacy-preserving MTS anomaly detection for network devices through federated learning[J]. Information Sciences (CCF B, JCR Q1 & CAS Tier 1-Top, IF: 8.1), 2024: 121590 .
2. Yongqian Sun, Minghan Liang, Shenglin Zhang*, Zeyu Che, Zhiyao Luo, Dongwen Li, Yuzhi Zhang, Dan Pei, Lemeng Pan, and Liping Hou. Efficient Multivariate Time Series Anomaly Detection Through Transfer Learning for Large-Scale Software Systems. ACM Transactions on Software Engineering and Methodology (TOSEM), 2024 (CCF A) .
3. Shenglin Zhang, Yongxin Zhao, Sibo Xia, Shirui Wei, Yongqian Sun*, Chenyu Zhao, Shiyu Ma, Junhua Kuang, Bolin Zhu, Lemeng Pan, Yicheng Guo, Dan Pei. No More Data Silos: UnifiedMicroservice Failure Diagnosis with Temporal Knowledge Graph. IEEE Transactions on Services Computing (TSC), 2024 , 17(6):4013-4026 (CCF A) .
4. Yongqian Sun, Zihan Lin, Binpeng Shi, Shenglin Zhang*, Shiyu Ma, Pengxiang Jin, Zhenyu Zhong, Lemeng Pan, Yicheng Guo, Dan Pei. Interpretable Failure Localization for Microservice Systems Based on Graph Autoencoder. ACM Transactions on Software Engineering and Methodology (TOSEM), , 2024 , 34(2):1-28 (CCF A) .
5. Yuan Yuan*, Tongqing Zhou*, Xiuhong Tan, Yongqian Sun, Yuqi Li, Zhixing Li, Zhiping Cai, and Tiejun Li. Exploring Hierarchical Patterns for Alert Aggregation in Supercomputers. 2024 International Symposium on Software Reliability Engineering (ISSRE), Tsukuba, Japan, October 28-31, 2024 (Best Paper Award, CCF B) .
6. Yongqian Sun, Yang Guo, Minghan Liang, Xidao Wen, Junhua Kuang, Shenglin Zhang*, Hongbo Li, Kaixu Xia, and Dan Pei. Multivariate Time Series Anomaly Detection based on Pre-trained Models with Dual-Attention Mechanism. 2024 International Symposium on Software Reliability Engineering (ISSRE), Tsukuba, Japan, October 28-31, 2024 (CCF B) .
7. Shenglin Zhang, Zeyu Che, Zhongjie Pan, Xiaohui Nie, Yongqian Sun*, Lemeng Pan, Dan Pei. LabelEase: A Semi-Automatic Tool for Efficient and Accurate Trace Labeling in Microservices. 2024 International Symposium on Software Reliability Engineering (ISSRE), Tsukuba, Japan, October 28-31, 2024 (CCF B) .
8. Shenglin Zhang, Xiao Xiong, Mengyao Li, Yongqian Sun*, Yongxin Zhao, Xia Chen, Bowen Deng and Dan Pei. Auto-PIP: Real-time Identification of Critical Performance Inflection Points in Software Stress Testing. 2024 International Symposium on Software Reliability Engineering (ISSRE), Tsukuba, Japan, October 28-31, 2024 (Best Industry Paper Award, CCF B) .
9. Shenglin Zhang, Pengtian Zhu, Minghua Ma, Jiagang Wang, Yongqian Sun*, Dongwen Li, Jingyu Wang, Qianying Guo, Xiaolei Hua, Lin Zhu, Dan Pei. Enhanced Fine-Tuning of Lightweight Domain-Specific Q&A model Based on Large Language Models. 2024 International Symposium on Software Reliability Engineering (ISSRE), Tsukuba, Japan, October 28-31, 2024 (CCF B) .
10. Yongqian Sun, Binpeng Shi, Mingyu Mao, Minghua Ma, Sibo Xia, Shenglin Zhang*, Dan Pei. ART: A Unified Unsupervised Framework for Incident Management in Microservice Systems. 2024 IEEE/ACM Automated Software Engineering Conference (ASE), Sacramento, California, United States, October 27 – November 1, 2024 (CCF A) .
11. Lei Tao, Shenglin Zhang, Zedong Jia, Jinrui Sun, Minghua Ma, Zhengdan Li*, Yongqian Sun, Canqun Yang, Yuzhi Zhang, Dan Pei. Giving Every Modality a Voice in Microservice Failure Diagnosis via Multimodal Adaptive Optimization. 2024 IEEE/ACM Automated Software Engineering Conference (ASE), Sacramento, California, United States, October 27 – November 1, 2024 (CCF A).
12. Shenglin Zhang, Yuhe Ji, Jiaqi Luan, Xiaohui Nie, Zi`ang Chen, Minghua Ma, Yongqian Sun*, Dan Pei. End-to-End AutoML for Unsupervised Log Anomaly Detection. 2024 IEEE/ACM Automated Software Engineering Conference (ASE), Sacramento, California, United States, October 27 – November 1, 2024 (CCF A) .
13. Zhe Xie, Shenglin Zhang, Yitong Geng, Yao Zhang, Minghua Ma, Xiaohui Nie, Zhenhe Yao, Longlong Xu, Yongqian Sun, Wentao Li, Dan Pei. Microservice Root Cause Analysis With Limited Observability Through Intervention Recognition in the Latent Space. 2024 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), Barcelona, Spain, August 2024 (CCF A) .
14. Lei Tao, Xianglin Lu, Shenglin Zhang*, Jiaqi Luan, Yingke Li, Mingjie Li, Zeyan Li, Qingyang Yu, Hucheng Xie, Ruijie Xu, Chenyuan Hu, Canqun Yang, Dan Pei. Diagnosing Performance Issues for Large-Scale Microservice Systems with Heterogeneous Graph. IEEE Transactions on Services Computing, 2024, 17(5):2223-2235 (CCF A) .
15. Shenglin Zhang, Jun Zhu, Bowen Hao, Yongqian Sun*, Xiaohui Nie, Jingwen Zhu, Xilin Liu, Xiaoqian Li, Yuchi Ma, Dan Pei. Fault Diagnosis for Test Alarms in Microservices Through Multi-source Data. ACM International Conference on the Foundations of Software Engineering (FSE), Industry Track. Porto de Galinhas, Brazil, July 15-19, 2024 (CCF A) .
16. Shenglin Zhang, Yongxin Zhao, Xiao Xiong, Yongqian Sun*, Xiaohui Nie, Jiacheng Zhang, Fenglai Wang, Xian Zheng, Yuzhi Zhang, Dan Pei. Illuminating the Gray Zone: Non-Intrusive Gray Failure Localization in Server Operating Systems. ACM International Conference on the Foundations of Software Engineering (FSE), Industry Track. Porto de Galinhas, Brazil, July 15-19, 2024 (CCF A).
17. Sibo Xia, Minghua Ma, Pengxiang Jin, Liyue Cui, Shenglin Zhang*, Wa Jin, Yongqian Sun, Dan Pei. Response Time Anomaly Diagnosis for Search Service[J]. Journal of Computer Research and Development, 2024, 61(6): 1573-1584 (in Chinese).
18. Zhaoyang Yu#, Shenglin Zhang#, Mingze Sun, Li Yingke, Zhaoyankai, Xiaolei Hua, Lin Zhu, Xidao Wen, Dan Pei*. Fine-Tuning for Unsupervised KPI Anomaly Detection for Mobile Web Systems.The Web Conference 2024. Singapore, May 13 – 17, 2024 (CCF A).
2023
1. Shenglin Zhang, Ting Xu, Jun Zhu, Yongqian Sun*, Pengxiang Jin, Binpeng Shi, Dan Pei. Privacy-preserving MTS anomaly detection for network devices through federated learning[J]. Information Sciences (CCF B, JCR Q1 & CAS Tier 1-Top, IF: 8.1), 2024: 121590 .
2. Yongqian Sun, Minghan Liang, Shenglin Zhang*, Zeyu Che, Zhiyao Luo, Dongwen Li, Yuzhi Zhang, Dan Pei, Lemeng Pan, and Liping Hou. Efficient Multivariate Time Series Anomaly Detection Through Transfer Learning for Large-Scale Software Systems. ACM Transactions on Software Engineering and Methodology (TOSEM), 2024 (CCF A) .
3. Shenglin Zhang, Yongxin Zhao, Sibo Xia, Shirui Wei, Yongqian Sun*, Chenyu Zhao, Shiyu Ma, Junhua Kuang, Bolin Zhu, Lemeng Pan, Yicheng Guo, Dan Pei. No More Data Silos: UnifiedMicroservice Failure Diagnosis with Temporal Knowledge Graph. IEEE Transactions on Services Computing (TSC), 2024 , 17(6):4013-4026 (CCF A) .
4. Yongqian Sun, Zihan Lin, Binpeng Shi, Shenglin Zhang*, Shiyu Ma, Pengxiang Jin, Zhenyu Zhong, Lemeng Pan, Yicheng Guo, Dan Pei. Interpretable Failure Localization for Microservice Systems Based on Graph Autoencoder. ACM Transactions on Software Engineering and Methodology (TOSEM), , 2024 , 34(2):1-28 (CCF A) .
5. Yuan Yuan*, Tongqing Zhou*, Xiuhong Tan, Yongqian Sun, Yuqi Li, Zhixing Li, Zhiping Cai, and Tiejun Li. Exploring Hierarchical Patterns for Alert Aggregation in Supercomputers. 2024 International Symposium on Software Reliability Engineering (ISSRE), Tsukuba, Japan, October 28-31, 2024 (Best Paper Award, CCF B) .
6. Yongqian Sun, Yang Guo, Minghan Liang, Xidao Wen, Junhua Kuang, Shenglin Zhang*, Hongbo Li, Kaixu Xia, and Dan Pei. Multivariate Time Series Anomaly Detection based on Pre-trained Models with Dual-Attention Mechanism. 2024 International Symposium on Software Reliability Engineering (ISSRE), Tsukuba, Japan, October 28-31, 2024 (CCF B) .
7. Shenglin Zhang, Zeyu Che, Zhongjie Pan, Xiaohui Nie, Yongqian Sun*, Lemeng Pan, Dan Pei. LabelEase: A Semi-Automatic Tool for Efficient and Accurate Trace Labeling in Microservices. 2024 International Symposium on Software Reliability Engineering (ISSRE), Tsukuba, Japan, October 28-31, 2024 (CCF B) .
8. Shenglin Zhang, Xiao Xiong, Mengyao Li, Yongqian Sun*, Yongxin Zhao, Xia Chen, Bowen Deng and Dan Pei. Auto-PIP: Real-time Identification of Critical Performance Inflection Points in Software Stress Testing. 2024 International Symposium on Software Reliability Engineering (ISSRE), Tsukuba, Japan, October 28-31, 2024 (Best Industry Paper Award, CCF B) .
9. Shenglin Zhang, Pengtian Zhu, Minghua Ma, Jiagang Wang, Yongqian Sun*, Dongwen Li, Jingyu Wang, Qianying Guo, Xiaolei Hua, Lin Zhu, Dan Pei. Enhanced Fine-Tuning of Lightweight Domain-Specific Q&A model Based on Large Language Models. 2024 International Symposium on Software Reliability Engineering (ISSRE), Tsukuba, Japan, October 28-31, 2024 (CCF B) .
10. Yongqian Sun, Binpeng Shi, Mingyu Mao, Minghua Ma, Sibo Xia, Shenglin Zhang*, Dan Pei. ART: A Unified Unsupervised Framework for Incident Management in Microservice Systems. 2024 IEEE/ACM Automated Software Engineering Conference (ASE), Sacramento, California, United States, October 27 – November 1, 2024 (CCF A) .
11. Lei Tao, Shenglin Zhang, Zedong Jia, Jinrui Sun, Minghua Ma, Zhengdan Li*, Yongqian Sun, Canqun Yang, Yuzhi Zhang, Dan Pei. Giving Every Modality a Voice in Microservice Failure Diagnosis via Multimodal Adaptive Optimization. 2024 IEEE/ACM Automated Software Engineering Conference (ASE), Sacramento, California, United States, October 27 – November 1, 2024 (CCF A).
12. Shenglin Zhang, Yuhe Ji, Jiaqi Luan, Xiaohui Nie, Zi`ang Chen, Minghua Ma, Yongqian Sun*, Dan Pei. End-to-End AutoML for Unsupervised Log Anomaly Detection. 2024 IEEE/ACM Automated Software Engineering Conference (ASE), Sacramento, California, United States, October 27 – November 1, 2024 (CCF A) .
13. Zhe Xie, Shenglin Zhang, Yitong Geng, Yao Zhang, Minghua Ma, Xiaohui Nie, Zhenhe Yao, Longlong Xu, Yongqian Sun, Wentao Li, Dan Pei. Microservice Root Cause Analysis With Limited Observability Through Intervention Recognition in the Latent Space. 2024 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), Barcelona, Spain, August 2024 (CCF A) .
14. Lei Tao, Xianglin Lu, Shenglin Zhang*, Jiaqi Luan, Yingke Li, Mingjie Li, Zeyan Li, Qingyang Yu, Hucheng Xie, Ruijie Xu, Chenyuan Hu, Canqun Yang, Dan Pei. Diagnosing Performance Issues for Large-Scale Microservice Systems with Heterogeneous Graph. IEEE Transactions on Services Computing, 2024, 17(5):2223-2235 (CCF A) .
15. Shenglin Zhang, Jun Zhu, Bowen Hao, Yongqian Sun*, Xiaohui Nie, Jingwen Zhu, Xilin Liu, Xiaoqian Li, Yuchi Ma, Dan Pei. Fault Diagnosis for Test Alarms in Microservices Through Multi-source Data. ACM International Conference on the Foundations of Software Engineering (FSE), Industry Track. Porto de Galinhas, Brazil, July 15-19, 2024 (CCF A) .
16. Shenglin Zhang, Yongxin Zhao, Xiao Xiong, Yongqian Sun*, Xiaohui Nie, Jiacheng Zhang, Fenglai Wang, Xian Zheng, Yuzhi Zhang, Dan Pei. Illuminating the Gray Zone: Non-Intrusive Gray Failure Localization in Server Operating Systems. ACM International Conference on the Foundations of Software Engineering (FSE), Industry Track. Porto de Galinhas, Brazil, July 15-19, 2024 (CCF A).
17. Sibo Xia, Minghua Ma, Pengxiang Jin, Liyue Cui, Shenglin Zhang*, Wa Jin, Yongqian Sun, Dan Pei. Response Time Anomaly Diagnosis for Search Service[J]. Journal of Computer Research and Development, 2024, 61(6): 1573-1584 (in Chinese).
18. Zhaoyang Yu#, Shenglin Zhang#, Mingze Sun, Li Yingke, Zhaoyankai, Xiaolei Hua, Lin Zhu, Xidao Wen, Dan Pei*. Fine-Tuning for Unsupervised KPI Anomaly Detection for Mobile Web Systems.The Web Conference 2024. Singapore, May 13 – 17, 2024 (CCF A).
2022
1. Yongqian Sun, Daguo Cheng, Pengxiang Jin, Quan Ding, Shenglin Zhang*, Xu Chen, Yuzhi Zhang, Minghan Liang, Dan Pei, Jianyan Zheng, Sen Luo, Xinyu Tang. Robust Anomaly Clue Localization of Multi-dimensional Derived Measure for Online Video Services. IEEE Transactions on Services Computing. Accepted (CCF B, SCI中科院1区, Impact Factor: 8.21).
2. Xianglin Lu, Zhe Xie, Zeyan Li, Mingjie Li, Xiaohui Nie, Nengwen Zhao, Qingyang Yu, Shenglin Zhang, Kaixin Sui, Lin Zhu and Dan Pei. Generic and Robust Performance Diagnosis via Causal Inference for OLTP Database systems. IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGrid), May 16-19, 2022 (CCF C).
3. Shenglin Zhang, Dongwen Li, Zhenyu Zhong, Jun Zhu, Minghan Liang, Jiexi Luo, Yongqian Sun*, Ya Su, Sibo Xia, Zhongyou Hu, Yuzhi Zhang, Dan Pei, Jiyan Sun and Yinlong Liu. Robust System Instance Clustering for Large-Scale Web Services. The Web Conference (WWW), Virtual Conference, April 25-29, 2022 (CCF A).
4. Xiaolei Hua, Lin Zhu, Shenglin Zhang, Zeyan Li, Su Wang, Dong Zhou, Shuo Wang, Chao Deng. GenAD: General Representations of Multivariate Time Series for Anomaly Detection. Artificial Intelligence for Cyber Security (AICS), AAAI-22 Workshop, Vancouver, BC, Canada, February 2022.
2021
1. Shenglin Zhang, Chenyu Zhao, Yicheng Sui, Ya Su*, Yongqian Sun, Yuzhi Zhang, Dan Pei, Yizhe Wang. “Robust KPI Anomaly Detection for Large-Scale Software Services with Partial Labels”.IEEE International Symposium on Software Reliability Engineering (ISSRE), October 25-28, 2021, Wuhan, China (CCF B).
2. Minghua Ma, Shenglin Zhang*, Junjie Chen, Haozhe Li, Yongliang Lin, Jim Xu, Xiaohui Nie, Bo Zhu, Yong Wang. “Jump-Starting Multivariate Time Series Anomaly Detection for Online Service Systems”. USENIX Annual Technical Conference (USENIX ATC), Virtual Conference, July 14-16, 2021 (CCF A).
3. Ya Su, Youjian Zhao, Ming Sun, Shenglin Zhang*, Xidao Wen, Yongsu Zhang, Xian Liu, Xiaozhou Liu, Junliang Tang, Wenfei Wu, Dan Pei. “Detecting Outlier Machine Instances through Gaussian Mixture Variational Autoencoder with One Dimensional CNN”. IEEE Transactions on Computers (TC). (CCF A, SCI indexed, Impact Factor: 2.711,中科院2区)
4. Weibin Meng, Ying Liu, Shenglin Zhang*, Federico Zaiter, Yuzhe Zhang, Yuheng Huang, Zhaoyang Yu, Yuzhi Zhang, Lei Song, Ming Zhang, Dan Pei. “LogClass: Anomalous Log Identification and Classification with Partial Labels”. IEEE Transactions on Network and Service Management (TNSM), Volume 18, Issue 2, pp 1870 - 1884, June 2021 (SCI indexed, Impact Factor: 3.878,中科院2区).
5. Ming Sun, Ya Su, Shenglin Zhang, Yuanpu Cao, Yuqing Liu, Dan Pei, Wenfei Wu, Yongsu Zhang, Xiaozhou Liu, Junliang Tang. “CTF: Anomaly Detection in High-Dimensional Time Series with Coarse-to-Fine Model Transfer”. IEEE International Conference on Computer Communications (INFOCOM) 2021, Virtual Conference, May 2021 (CCF A)
2020
1. 苏金树,赵宝康,董德尊,吕高锋,文梅,魏亮,彭伟,李福亮,张圣林,孙永谦. 新一代数据中心网络技术研究进展.《CCF 2019-2020中国计算机科学技术发展报告》,机械工业出版社,2020.10.
2. Rui Chen, Shenglin Zhang, Dongwen Li, Yuzhe Zhang, Fangrui Guo, Weibin Meng, Dan Pei, Yuzhi Zhang, Xu Chen, Yuqing Liu. "Cross-System Log Anomaly Detection for Software Systems". IEEE International Symposium on Software Reliability Engineering (ISSRE), Virtual Conference, October 2020 (CCF B).
3. Ping Liu, Haowen Xu, Qianyu Ouyang, Rui Jiao, Zhekang Chen, Xiaoying Bai, Shenglin Zhang, Jiahai Yang, Linlin Mo, Jice Zeng, Wenman Xue, Dan Pei. “Unsupervised Detection of Microservice Trace Anomalies through Service-Level Deep Bayesian Networks”. IEEE International Symposium on Software Reliability Engineering (ISSRE), Virtual Conference, October 2020 (CCF B).
4. Minghua Ma, Zheng Yin, Shenglin Zhang, Sheng Wang, Christopher Zheng, Xinhao Jiang, Hanwen Hu, Cheng Luo, Yilin Li, Nengjun Qiu, Feifei Li, Changcheng Chen, Dan Pei. “Diagnosing Root Causes of Intermittent Slow Queries in Cloud Databases”. International Conference on Very Large Data Bases (VLDB), Virtual Conference, August 2020 (CCF A).
5. Weibin Meng, Ying Liu, Federico Zaiter, Shenglin Zhang*, Yihao Chen, Yuzhe Zhang, Yichen Zhu, En Wang, Ruizhi Zhang, Shimin Tao, Dian Yang, Rong Zhou, Dan Pei. “LogParse: Making Log Parsing Adaptive through Word Classification”. IEEE International Conference on Computer Communications (ICCCN) 2020, Virtual Conference, August 3-6, 2020 (CCF C).
6. Weibin Meng, Ying Liu, Yuheng Huang, Shenglin Zhang*, Federico Zaiter, Bingjin Chen, Dan Pei. “A Semantic-aware Representation Framework for Online Log Analysis”. IEEE International Conference on Computer Communications (ICCCN) 2020, Virtual Conference, August 3-6, 2020 (CCF C).
7. Yuan Meng, Shenglin Zhang*, Yongqian Sun, Ruru Zhang, Zhilong Hu, Yiyin Zhang, Chenyang Jia, Zhaogang Wang, Dan Pei. “Localizing Failure Root Causes in a Microservice through Causality Inference”. International Symposium on Quality of Service (IWQoS), Virtual Conference, June 2020 (CCF B)
8. 张圣林,林潇霏,孙永谦,张玉志,裴丹. 基于深度学习的无监督KPI异常检测. 《数据与计算发展前沿》, 2(3): 87-100, 2020.6 (邀稿)
9. 张圣林,李东闻,孙永谦,孟伟彬,张宇哲,张玉志,刘莹,裴丹. 面向云数据中心多语法日志通用异常检测机制. 《计算机研究与发展》,57(4):778-790, 2020.
10. Shenglin Zhang, Ying Liu, Weibin Meng, Jiahao Bu, Sen Yang, Yongqian Sun*, Dan Pei, Jun Xu, Yuzhi Zhang, Lei Song, Ming Zhang. “Efficient and Robust Syslog Parsing for Network Devices in Datacenter Networks”. IEEE Access, Volume 8, pp 30245-30261, February 2020 (SCI Indexed, Impact Factor: 4.098,中科院2区)
2019
1. Ping Liu, Yu Chen, Xiaohui Nie, Jing Zhu, Shenglin Zhang, Kaixin Sui, Ming Zhang, Dan Pei. “FluxRank: A Widely-Deployable Framework to Automatically Localizing Root Cause Machines for Software Service Failure Mitigation”. IEEE International Symposium on Software Reliability Engineering (ISSRE), Berlin, Germany, October 2019 (CCF B)
2. Weibin Meng, Ying Liu, Yichen Zhu, Shenglin Zhang*, Dan Pei, Yuqing Liu, Yihao Chen, Ruizhi Zhang, Shimin Tao, Pei Sun, Rong Zhou. “LogAnomaly: Unsupervised Detection of Sequential and Quantitative Anomalies in Unstructured Logs”. International Joint Conference on Artificial Intelligence (IJCAI), Macao, China, August 2019 (CCF A).
3. Yuan Meng, Shenglin Zhang*, Zijie Ye, Benliang Wang, Zhi Wang, Yongqian Sun, Qitong Liu, Shuai Wang, Dan Pei. “Causal Analysis of the Unsatisfying Experience in Realtime Mobile Multiplayer Games in the Wild”. IEEE International Conference on Multimedia and Expo (ICME), Shanghai, China, July 2019 (CCF B)
2018
1. Shenglin Zhang, Ying Liu, Dan Pei, Yu Chen, Xianping Qu, Shimin Tao, Zhi Zang, Xiaowei Jing, Mei Feng. ``FUNNEL: Assessing Software Changes in Web-based Services”, IEEE Transactions on Services Computing, Volume 11, Issue 1, January - February 2018 (SCI Indexed, Impact Factor: 5.82,中科院2区)
2. Yongqian Sun, Youjian Zhao, Ya Su, Dapeng Liu, Xiaohui Nie, Yuan Meng, Shiwen Cheng, Dan Pei∗, Shenglin Zhang, Xianping Qu, Xuanyou Guo. ``HotSpot: Anomaly Localization for Additive KPIs With Multi-Dimensional Attributes'', IEEE Access, Volume 6, pp. 10909 - 10923, February 2018 (SCI Indexed, Impact Factor: 4.098,中科院2区)
3. Minghua Ma, Shenglin Zhang*, Dan Pei, Xin Huang, Hongwei Dai. `` Robust and Rapid Adaption for Concept Drift in Software System Anomaly Detection''.IEEE International Symposium on Software Reliability Engineering (ISSRE), Memphis, TN, USA, October 2018 (Best Research Paper Award, CCF B)
4. Shenglin Zhang, Ying Liu, Weibin Meng, Zhiling Luo, Jiahao Bu, Sen Yang, Peixian Liang, Dan Pei, Jun (Jim) Xu, Yuzhi Zhang, Yu Chen, Hui Dong, Xianping Qu, Lei Song. ``PreFix: Switch Failure Prediction in Datacenter Networks ". ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems 2018, Irvine, California, USA, June 18-22, 2018 (CCF B, acceptance rate 20%, 54/270, one of the only two papers with institutes in Mainland China)
5. Weibin Meng, Ying Liu, Shenglin Zhang*, Dan Pei, Hui Dong, Lei Song, Xulong Luo. ``Device-Agnostic Log Anomaly Classification with Partial Labels'', IEEE/ACM International Symposium on Quality of Service (IWQOS) 2018, Banff, Alberta, Canada, June 2018 (CCF B)
6. Jiahao Bu, Ying Liu, Shenglin Zhang*, Weibin Meng, Qitong Liu, Xiaotian Zhu, Dan Pei. “Rapid Deployment of Anomaly Detection Models for Large Number of Emerging KPI Streams”. International Performance Computing and Communications Conference (IPCCC), Orlando, Florida, USA, November 2018 (CCF C)
7. Shenglin Zhang, Ying Liu, Dan Pei, and Baojun Liu. ``Measuring BGP AS Path Looping (BAPL) and Private AS Number Leaking (PANL)'', Journal of Tsinghua University (Science and Technology), Volume 23, Number 1, pp 22– 34, February 2018 (SCI Indexed, IF 1.328)
2017
1. 裴丹,张圣林,裴昶华。《基于机器学习的智能运维》。中国计算机学会通讯,专栏文章,2017年第12期
2. Shenglin Zhang, Weibin Meng, Jiahao Bu, Sen Yang, Ying Liu, Dan Pei, Jun (Jim) Xu, Yu Chen, Hui Dong, Xianping Qu, Lei Song. ``Syslog Processing for Switch Failure Diagnosis and Prediction in Datacenter Networks”, IEEE/ACM International Symposium on Quality of Service (IWQOS) 2017, VILANOVA I LA GELTRÚ, SPAIN, June 2017 (CCF B)
2016及以前
1. Shenglin Zhang, Ying Liu, Dan Pei, Yu Chen, Xianping Qu, Shimin Tao, and Zhi Zang. ``Rapid and Robust Impact Assessment of Software Changes in Large Internet-based Services”, ACM International Conference on emerging Networking EXperiments and Technologies (CoNEXT), Heidelberg, Germany, December 2015, 13 pages (CCF B)
2. Ying Liu, Shenglin Zhang*, Hongying Liu. ``A bottleneck-free model for P4P”, SCIENCE CHINA Information Sciences, Volume 58, Issue 10, pp 1-15, October 2015 (SCI Indexed, Impact Factor: 3.304,中科院2区)
3. Ying Liu, Gang Ren, Jianping Wu, Shenglin Zhang, Lin He, Yihao Jia. ``Building An IPv6 Address Generation and Traceback System With NIDTGA in Address Driven Network'', SCIENCE CHINA Information Sciences, Volume 58, Issue 12, pp 1-14, December 2015 ( (SCI Indexed, Impact Factor: 3.304,中科院2区)
4. Shenglin Zhang, Ying Liu, Dan Pei. ``A Measurement Study on BGP AS Path Looping Behavior”. IEEE International Conference on Computer Communications and Networks (ICCCN), Shanghai, China, August 4, 2014, 7 pages. (CCF C)
5. Ying Liu, Shenglin Zhang*, Hongying Liu. ``An Improved Cooperative Model of P2P and ISP”. Frontiers in Internet Technologies, 85-96, LNCS, Springer, 2013. (EI检索)
6. 张圣林,刘莹。AS 路径环路的研究。《通信学报》, 2013, (Z2): 17-22. (EI检索)
讲授课程
计算机网络
软件测试与维护
IT运维前沿
Computer Algorithm Design and Analysis
社会兼职
1.CCF互联网专委常务委员
2.CCF软件工程专委执行委员
3.CCF服务计算专委执行委员
4.CCF YOCSEF天津副主席(2023-2024)
5.Artifact Evaluation Co-Chair of ISSRE 2024
6.TPC member of FSE 2025 industry track
7.TPC member of KDD 2025 ADS track
8.TPC member of WSDM 2023/2024/2025
9.TPC member of IEEE/ACM IWQoS 2022/2023