Job DescriptionStrength Through DiversityGround breaking science. Advancing medicine. Healing made personal.Roles & Responsibilities: The Scientific Computing and Data group at the Icahn School of Medicine at Mount Sinai partners with scientists to accelerate scientific discovery. To achieve these aims, we support a cutting-edge high-performance computing and data ecosystem along with MD/PhD-level support for researchers. The group is composed of a high-performance computing team, the research clinical data warehouse team and a research data services team. The Lead HPC Architect, High Performance Computational and Data Ecosystem, is responsible for architecting, designing, and leading the technical operations for Scientific Computing's computational and data science ecosystem. This ecosystem includes high-performance computing (HPC) systems, clinical research databases, and a software development infrastructure for local and national projects. To meet Sinai's scientific and clinical goals, the Lead brings a strategic, tactical and customer-focused vision to evolve Sinai's computational and data-rich environment to be continually more resilient, scalable and productive for basic and translational biomedical research. The development and execution of the vision includes a deep technical understanding of the best practices for computational, data and software development systems along with a strong focus on customer service for researchers. The Lead is an expert troubleshooter and productive team member. The incumbent is a productive partner for researchers and technologists throughout the organization and beyond. This position reports to the Director for Computational & Data Ecosystem in Scientific Computing. Responsibilities
- Lead the technical operations including the architect, design, expansion, monitoring, support, and maintenance for Scientific Computing's computational and data science ecosystem consistent with best practices. Key components include a 50,000+ core and 30+ petabyte usable high-performance computing cluster, clinical data warehouse and software development environment.
- Lead the troubleshooting, isolation and resolution of all technical issues
- Lead the design, development, implementation and management of all system administration tasks, including hardware and software configuration, configuration management, system monitoring (including the development and maintenance of regression tests), usage reporting, system performance (file systems, scheduler, interconnect, high availability, etc.), security, networking and metrics, etc.
- Ensures that the design and operation of the HPC ecosystem is productive for research.
- Collaborates effectively with research and hospital system IT, compliance, HIPAA, security and other departments to ensure compliance with all regulations and Sinai policies.
- Partners with other peers regionally, nationally and internationally to discover, propose and deploy a world-class research infrastructure for Mount Sinai.
- Prepares and manages budgets for hardware, software and maintenance. Participates in chargeback/fee recovery analysis and provides suggestions to make operations sustainable.
- Lead the integration of HPC resources with laboratory equipment such as genomic sequencers, etc.
- Researches, deploys and optimizes resource management and scheduling software and policies and actively monitoring.
- Designs, tunes, manages and upgrades parallel file systems, storage and data-oriented resources.
- Researches, deploys and manages security infrastructure, including development of policies and procedures.
- Lead and assist the team to resolve user support requests from researchers.
- Assists in developing and writing system design for research proposals.
- Lead the development of a framework for effective system documentation.
- Works effectively and productively with other team members within the group and across Mount Sinai.
- Provide after-hours support in case of a critical system issue.
Qualifications
- Bachelor's degree in computer science, engineering or another scientific field. Master's or PhD preferred.
- 8 years of progressive HPC system administration and operations (preferably in a Redhat/CentOS Linux administration, Batch HPC cluster environment)
- Must be an expert troubleshooter; Must be a team player and customer focused
- Strong experience with configuration management systems such as xCAT, Puppet and/or Ansible
- Strong experience with networking and security
- Strong experience with Infiniband and Gigabit Ethernet
- Experience with LSF and GPFS Spectrum Scale parallel file systems and storage
- Experience with providing technical operations leadership
- Ability to manage a variety of disparate tasks and priorities independently and troubleshoot complex technology problems.
- Attention to detail; time and project management skills.
- Excellent communication skills, analytical ability, strong judgment and management skills, and the ability to work effectively as a liaison between both research and technology teams.
- Strong written, oral, and interpersonal communication skills
- Script and programming experience
Preferred Experience
- Experience with archival storage and tape libraries (TSM) is highly preferred.
- Experience with databases and web services is highly preferred.
- Compliance, HIPAA, GDPR, FISMA
- Experience with managing web access to HPC resources (such as Open OnDemand)
- Experience in a research environment is highly preferred.
- Experience with financial budgets and providing cost benefit analysis is preferred.
- Cloud Technology
About Us
Strength Through Diversity The Mount Sinai Health System believes that diversity, equity, and inclusion are key drivers for excellence. We share a common devotion to delivering exceptional patient care. When you join us, you become a part of Mount Sinai's unrivaled record of achievement, education, and advancement as we revolutionize medicine together. We invite you to participate actively as a part of the Mount Sinai Health System team by:
- Using a lens of equity in all aspects of patient care delivery, education, and research to promote policies and practices to allow opportunities for all to thrive and reach their potential.
- Serving as a role model confronting racist, sexist, or other inappropriate actions by speaking up, challenging exclusionary organizational practices, and standing side-by-side in support of colleagues who experience discrimination.
- Inspiring and fostering an environment of anti-racist behaviors among and between departments and co-workers.
At Mount Sinai, our leaders strive to learn, empower others, and embrace change to further advance equity and improve the well-being of staff, patients, and the organization. We expect our leaders to embrace anti-racism, create a collaborative and respectful environment, and constructively disrupt the status quo to improve the system and enhance care for our patients. We work hard to create an inclusive, welcoming and nurturing work environment where all feel they are valued, belong and are able to advance professionally. Explore more about this opportunity and how you can help us write a new chapter in our history!
About the Mount Sinai Health System: Mount Sinai Health System is one of the largest academic medical systems in the New York metro area, with more than 43,000 employees working across eight hospitals, more than 400 outpatient practices, more than 300 labs, a school of nursing, and a leading school of medicine and graduate education. Mount Sinai advances health for all people, everywhere, by taking on the most complex health care challenges of our time - discovering and applying new scientific learning and knowledge; developing safer, more effective treatments; educating the next generation of medical leaders and innovators; and supporting local communities by delivering high-quality care to all who need it. Through the integration of its hospitals, labs, and schools, Mount Sinai offers comprehensive health care solutions from birth through geriatrics, leveraging innovative approaches such as artificial intelligence and informatics while keeping patients' medical and emotional needs at the center of all treatment. The Health System includes approximately 7,400 primary and specialty care physicians; 13 joint-venture outpatient surgery centers throughout the five boroughs of New York City, Westchester, Long Island, and Florida; and more than 30 affiliated community health centers. We are consistently ranked by U.S. News & World Report's Best Hospitals, receiving high Honor Roll status, and are highly ranked: No. 1 in Geriatrics and top 20 in Cardiology/Heart Surgery, Diabetes/Endocrinology, Gastroenterology/GI Surgery, Neurology/Neurosurgery, Orthopedics, Pulmonology/Lung Surgery, Rehabilitation, and Urology. New York Eye and Ear Infirmary of Mount Sinai is ranked No. 12 in Ophthalmology. U.S. News & World Report's Best Children's Hospitals ranks Mount Sinai Kravis Children's Hospital among the country's best in several pediatric specialties. The Icahn School of Medicine at Mount Sinai is ranked No. 14 nationwide in National Institutes of Health funding and in the 99th percentile in research dollars per investigator according to the Association of American Medical Colleges. Newsweek's The World's Best Smart Hospitals ranks The Mount Sinai Hospital as No. 1 in New York and in the top five globally, and Mount Sinai Morningside in the top 20 globally. The Mount Sinai Health System is an equal opportunity employer. We comply with applicable Federal civil rights laws and does not discriminate, exclude, or treat people differently on the basis of race, color, national origin, age, religion, disability, sex, sexual orientation, gender identity, or gender expression. We are passionately committed to addressing racism and its effects on our faculty, staff, students, trainees, patients, visitors, and the communities we serve. Our goal is for Mount Sinai to become an anti-racist health care and learning institution that intentionally addresses structural racism.
EOE Minorities/Women/Disabled/Veterans