MD Anderson Cancer Center Hadoop Administrator -Institutional Analytics & Informatics in Houston, Texas
Institutional Analytics & Informatics Vision:
Enable informatics and analytics across service lines by providing the right information to the right people at the right time with the right tools. The Insitutional Analytics & Informatics team is focused on building the infrastructure and creating the capabilities necessary for sustainable clinical data delivery. With ongoing changes in the national health care environment, data will be increasingly required to optimize medical practice. Effective health care delivery must be of high quality and at a reasonable cost. Clinical information management using enterprise-wide vocabularies, data modeling, business intelligence solutions and natural language processing are key enabling technologies. We are the enterprise data architects and infrastructure experts.
The Hadoop Administrator is responsible for the Hadoop architecture, design, administration, maintenance and backup and recovery processes. This person is also responsible for trouble shooting, assisting with testing, performance tuning and documentation across the multiple environments. Strong communication and customer service skills are needed to work with a variety of customers, peers and management. Must be able to multitask and handle changing priorities.
- Hadoop Architecture:
Participates and operates as a lead to review business requirements, architecture design and proposed solutions in accordance with our Hadoop standards and requirements. Tasks include Hadoop architecture and design, security design, Kerberos, High Availability configuration and disaster recovery plans. Provides strategic vision and direction of the Hadoop environment. Advises on proper environment installation, configuration, maintenance, security and monitoring of the Hadoop environment. Expert level experience of the Hortonworks distribution of Hadoop and its architecture
- Hadoop Development and Design
Leads and/or mentors Hadoop development and design in accordance with departmental and institutional standards and guidelines for ingesting, transforming, governing and analyzing data. Development tasks include but are not limited to coding, troubleshooting, testing, documenting and enforcing Hadoop standards according to M. D. Anderson Information Services policies and procedures and best practices. Conducts review of other integration administrators' efforts to ensure consistent methodologies/standards and makes recommendations where necessary. Efforts include job scheduling, custom code development, performance tuning, Hadoop component installation and configuration, design and development of schemas, models in the Hadoop components and coordinating activities between internal teams, customers and other departments. Previous experience with Hortonworks Data Flow, NiFi and other streaming and batch integration components for loading data into HDFS, Hive and HBase.
Experience working with Data Scientists on tuning Spark jobs.
- Hadoop Production Support
Actively leads and assists in the installation, configuration, monitoring, maintenance and administration of the Hadoop environment using Ambari. Deep understanding of Linux and the Unix operating system, cpu, memory. Provides after hours support (on-call) as needed for daily, weekly, and monthly processes. Responsible for system availability, performance, code migration, backups, disaster recovery and security. Thorough understanding of Kerberos, Ranger and Knox security.
Seven years of integration programming and/or systems level experience. May substitute required education degree with additional years of equivalent experience on a one to one basis or completed years of college on a one to one basis.
Nine years of integration programming and/or systems level experience.
Onsite Presence: Is Required
It is the policy of The University of Texas MD Anderson Cancer Center to provide equal employment opportunity without regard to race, color, religion, age, national origin, sex, gender, sexual orientation, gender identity/expression, disability, protected veteran status, genetic information, or any other basis protected by institutional policy or by federal, state or local laws unless such distinction is required by law. http://www.mdanderson.org/about-us/legal-and-policy/legal-statements/eeo-affirmative-action.html