IBM Jobs

Job Information

IBM IaaS Monitoring DevOps Engineer in Austin, Texas


Software Developers at IBM are the backbone of our strategic initiatives to design, code, test, and provide industry-leading solutions that make the world run today - planes and trains take off on time, bank transactions complete in the blink of an eye and the world remains safe because of the work our software developers do. Whether you are working on projects internally or for a client, software development is critical to the success of IBM and our clients worldwide. At IBM, you will use the latest software development tools, techniques and approaches and work with leading minds in the industry to build solutions you can be proud of.

Your Role and Responsibilities


How can we effectively design and deliver for a large scale, highly distributed cloud infrastructure? We are looking for an individual who will work on a team that will design and implement the back-end infrastructure that supports the IBM Cloud. The job provides the opportunity to be a key part of a team that will be delivering those networks, infrastructure, and services for a world-class Cloud.

Your Role and Responsibilities

  • Implement and administrate infrastructure and solutions that support the IBM Cloud.

  • Support the compliance and security integrity of the environment through your work

  • Partner with other teams, functional managers and program managers to deliver mission-critical services to the market

  • Support development of new and enhanced existing capabilities for our compute, storage and network services

  • Provide technical escalation support for other Infrastructure Operations teams

  • Design, implement, manage and create a reliable, highly performant monitoring and alerting framework with dashboards, analytics, and correlation across IaaS

  • Work with and adopt open source technologies as well as participate in new IBM innovations, not just around monitoring, alerting, dashboards and root cause analysis, but across IaaS

  • Work towards a more autonomous root cause analysis system which deduplicates alerts and provides for a comprehensive single pane of glass monitoring infrastructure

  • A self-driven attitude to propose, test and implement solutions and improvements for review and consideration with your peers

Required Technical and Professional Expertise

  • 5+ years of experience in data center infrastructure or relevant work experience

  • 5+ years of experience in large-scale infrastructure design, engineering, and support

  • 5+ years of experience in IT Change, Incident, Problem, Asset management

  • 5+ years of infrastructure engineering with proven record for delivering high-quality, large-scale solutions. Experience designing architectures for scale and performance

  • 5+ years of practical experience with one or more operating systems: Ubuntu (Preferred), CentOS, RHEL or Debian Linux, and Windows Servers.

  • 5+ years of experience debugging issues across a Linux environment with network, storage, compute and orchestration components. Does not need to be code debugging.

  • 2+ years of extensive experience with Monitoring technologies: Zabbix (preferred), Grafana, Nagios, Zenoss, ELK, Splunk, etc.

  • 2+ years of experience with one or more Virtualization technologies: Citrix Xen Hypervisor (Preferred), KVM(also preferred), libvirt, qemu, VMware vSphere, etc.

  • 2+ years of experience with one or more automation and configuration management tools/solutions: Ansible(Preferred, Salt, Chef, python, bash, puppet, Rundeck, etc.

  • 2+ years of experience with version control systems: github(preferred), gitlab, subversion, etc.

  • Experience with one or more programming languages: PowerShell, Python, and Ruby

  • Practical experience with orchestration that uses desired state models and/or finite state machine models of orchestration: Kubernetes(Preferred), OpenShift, etc.

  • Practical experience Containerization and container orchestration: Docker(preferred) Kubernetes (preferred), OpenShift, rancher, docker swarm, docker compose

  • 2+ years of at least basic experience with databases, both RDBMs like mysql or postrgresql, as well as non-relational databases such as etcd, TimeScaleDB, InnoDB, etc. Not a DBA role.

  • Working knowledge with Network and Storage technologies

  • Working knowledge with ServiceNow, JIRA, Confluence, and GitHub

  • ITIL Foundation V4 certification is a plus

Preferred Technical and Professional Expertise

  • Excellent verbal and written communication skills

  • Highly responsible, motivated, able to work with little direction

  • Experience with design and development of complex systems

  • Ability to troubleshoot complex problems and customer issues

  • Working knowledge of Linux clustering, HA, and Fault Tolerant system implementations: active/active, active/passive, pacemaker, keepalived, haproxy, corosync, LVM

  • 2+ years of experience with complex systems and layered architecture models: OSI, Kubernetes, virtualization, TCP/IP, etc.

  • Working knowledge of what TCP/IP, BGP, Sockets, routing protocols, routes an keepalived are and how they participate in debugging and Highly available systems at scale.

  • Ability to debug an issue across the entire OSI stack of a typical Linux environment across storage, network, compute, OS, system tuning, orchestration.

  • Ability to debug stack traces to particular libraries in code and root cause identification.

  • Working knowledge of a message bus and message queues: kafka(preferred), Spark, RabbitMQ, redis, etc.

  • Extensive experience with databases and debugging their usage with application stacks

  • Experience with and understanding of the interaction and dependencies of a typical three tier model of application stacks, as well as cloud.


About Business UnitDigitization is accelerating the ongoing evolution of business, and clouds - public, private, and hybrid - enable companies to extend their existing infrastructure and integrate across systems. IBM Cloud provides the security, control, and visibility that our clients have come to expect. We are working to provide the right tools and environment to combine all of our client's data, no matter where it resides, to respond to changing market dynamics.

Your Life @ IBMAre you craving to learn more? Prepared to solve some of the world's most unique challenges? And ready to shape the future for millions of people? If so, then it's time to join us, express your individuality, unleash your curiosity and discover new possibilities.

Every IBMer, and potential ones like yourself, has a voice, carves their own path, and uses their expertise to help co-create and add to our story. Together, we have the power to make meaningful change - to alter the fabric of our clients, of society and IBM itself, to create a truly positive impact and make the world work better for everyone.

It's time to define your career.

About IBMIBM's greatest invention is the IBMer. We believe that through the application of intelligence, reason and science, we can improve business, society and the human condition, bringing the power of an open hybrid cloud and AI strategy to life for our clients and partners around the world.Restlessly reinventing since 1911, we are not only one of the largest corporate organizations in the world, we're also one of the biggest technology and consulting employers, with many of the Fortune 50 companies relying on the IBM Cloud to run their business. At IBM, we pride ourselves on being an early adopter of artificial intelligence, quantum computing and blockchain. Now it's time for you to join us on our journey to being a responsible technology innovator and a force for good in the world.

Location StatementIBM offers a wide range of resources for eligible IBMers to thrive both inside and outside of work. In addition to a competitive benefits program consisting of medical and life insurance, retirement plans, and time off, eligible employees may also have access to:

*12 weeks of paid parental bonding leave. Family care options are also available to support eligible employees during COVID-19.

*World-class training and educational resources on our personalized, AI-driven learning platform. IBM's learning culture supports your restless attitude to grow your skills and build the depth and scale of knowledge needed to achieve your career goals.

*Well-being programs to support mental and physical health.

*Financial programs that empower you to plan, save, and manage your money (including expert financial counseling, 401(k), IBM stock discount, etc.).

*Select educational reimbursement opportunities.

*Diverse and inclusive employee resource groups where you can network and connect with IBMers across the globe.

*Giving and volunteer programs to benefit charitable organizations and local communities.

*Discounts on retail products, services, and experiences.

We consider qualified applicants with criminal histories, consistent with applicable law.

Being You @ IBMIBM is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, pregnancy, disability, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.