Duties and responsibilities 工作职责:
1. Cooperation with developers on implementation of new versions of applications on K8s/VMs;
与开发人员合作在K8S/VM上发布各类新版本的应用程序;
2. Continuously optimize and improve infrastructure to support all business needs, Independently and efficiently to handle complex problems,
不断对基础设施进行优化和完善,为所有业务需求提供支持,具备先进的故障排除和复杂问题的解决能力;
3. Daily operations of Linux ba
基于Linux的监控系统Nagios/Zabbix/Prometheus进行新功能/新需求的研究与落地,特别是在操作系统层;
4. Responsible for the proper daily operations of the Linux systems including install, maintain and patch/upgrade Linux OS and various application software;
负责Linux系统的日常正常运行,包括安装、维护和补丁/升级Linux操作系统及各种应用软件;
5. Monitoring everyday systems and evaluate the availability of all server resources and perform all activities, Logging and Alerting, ensuring systems and applications are always up and running, using various automation tools;
使用各种自动化工具监视日常系统并评估所有服务器资源的可用性,执行所有活动、日志记录和警报,确保系统和应用程序始终处于正常运行状态;
6. Developing constantly documentation regarding configurations, operations and troubleshooting procedures related to the Linux platform all systems;
不断开发与Linux平台所有系统相关的配置、操作和故障排除过程相关的文档;
7.Participate to the definition of standards, guidelines, best practices and metrics as directed;
按照指示参与标准、指导方针、最佳实践和度量标准的定义;
8. Collaborating heavily with other team members, solving problems together. Assist in evaluating new requirements, technical design and standards as project or operation required.
与其他团队成员紧密合作,共同解决问题。根据项目或运营需要,协助评估新需求、技术设计和标准。
任职资格:
1. Bachelor degree or above (Computer related major is preferred), 4 years of experience in Linux systems administration in a 200 server’s environment;
本科及以上学历(计算机相关专业优先),具备4年以上中等规模(200台以上) Linux服务器环境管理经验;
2. Skilled in basic Linux systems management, such as server installation& system optimization, LVM, bash, yum/rpm, grub, file privilege;
熟练掌握Linux系统基本原理及操作,如服务器安装与系统优化、LVM、bash、yum/rpm、grub、文件权限等;
3. Solid experience with in configuration management tools like Ansible & Puppet, skilled with monitoring system Nagios/Zabbix/Prometheus;
熟悉 Ansible & Puppet等集群配置管理工具,熟练使用Nagios/Zabbix/ Prometheus监控系统;
4. Experiences strongly in service management, such as DNS, SFTP, NFS, SMTP, SAMBA etc. Skilled in any one of Shell/Python/Perl/Go/Ruby.
对DNS、SFTP、NFS、SMTP、SAMBA等服务具有丰富的管理经验,熟练使用至少一种编程或脚本语言 Shell/Python/Perl/Go/Ruby/Java;
5. Skilled with web service Apache/Nginx/Tomcat,as much as CA , Experience in one of middleware management such as RabbixMQ, Redis, Kafka;
熟练使用web服务Apache/Nginx/Tomcat以及CA, 有RabbixMQ, Redis、Kafka等中间件管理经验;
6. Skilled with one of HA/LB solution, such as keepallived/nginx /haproxy, Knowledge in transfer protocol, like TCP/IP, http, https and network module like OSI, TCP/IP, router etc;
熟练使用HA/LB解决方案,如keepalalive /nginx /haproxy,熟悉TCP/IP、http、https等传输协议,熟悉OSI、TCP/IP、路由器等网络模块。
7. Knowledge in DevOps & CI/CD, such as Svn/Git/Jenkins & Knowledge in ELK is preferred,
Skill in developing techniques and methodologies to resolve unprecedented problems or situations.
具有Devops理念,掌握CI/CD相关设计和软件,如Jenkins等,具备开发能力去解决现有的问题;
8. Familiar with k8s and its ecosystem components, Knowledge in ELK is preferred.
了解k8s及其生态圈,有elk相关经验优先。
Staff Welfare 员工福利:
1. After on boarding, providing supplementary commercial insurance besides the state-stipulated insurance and housing fund;
入职即为员工缴纳六险一金;
2. Fast and transparent promotion channel and broad development opportunities;
快速透明的晋升渠道,广阔的发展空间;
3. 5-15 days paid annual leave and 5 days paid sick leave. Free annual physical check-up;
每年5-15天带薪年休假,5天带薪病假,年度免费员工体检;
4. Attractive employee activities.
有吸引力的员工活动。