SENIOR MANAGER, ADVANCED RESEARCH COMPUTING (ARC) | |
Job ID | 66025-5252 |
Company | McMaster University |
Location | Hamilton |
Province | ON |
Date Posted | 2024-11-06 |
Job Type | Full-time |
Job Category | Education |
Description | |
Reporting to the Director, Research and High-Performance Computing Support (RHPCS), the Senior Manager - Advanced Research Computing (ARC), plays a critical leadership role in the planning and delivery of advanced research computing and digital research infrastructure and services to meet the diverse needs of researchers across all Faculties and Institutes. This position leads a team of IT and research technology specialists who provide comprehensive support throughout all phases of service development, including consultation, definition, design, planning, construction, testing, delivery, and ongoing support. Working within the larger McMaster IT ecosystem, the team contributes to the model of coordinated decentralization and will deliver the most appropriate technology solutions based on the specific needs of each researcher, drawing on the IT service opportunities across campus, as well as implementing and delivering managed digital research infrastructure and services. The senior manager also has expert knowledge of the national and provincial research ecosystem and can access and integrate the various strategic opportunities available to support our researchers. This position offers a unique opportunity to advance and support research across all academic disciplines at a top research-intensive U15 university. The successful candidate will significantly impact the research community both regionally and nationally, contributing to the university’s mission of fostering cutting-edge research. This position provides strategic technical leadership for prioritizing and delivering highly effective ARC systems, ensuring that McMaster’s investment in advanced research computing systems meets the technical needs of researchers across all faculties.
JOB SPECIFIC ACCOUNTABILITIES Team Leadership:
Technical Expertise and Management:
Stakeholder Engagement:
Research Computing Infrastructure: Grant Support and System Management:
Strategic Vision:
CORE COMPETENCIES COMMUNICATING: Exchanges organizational knowledge clearly, frequently, and consistently; communicates plans and decisions necessary for meaningful participation and optimal work by team members, colleagues, stakeholders, other IT professionals and leadership; listens actively, with insight and respect to others' perspectives; perceives and reacts sensitively to others' nonverbal cues; builds relationships and attracts support; gives recognition to groups and individuals who contribute to the University's achievement and over all goals and objectives. DEVELOPING PEOPLE: Advocates and enables continuous learning opportunities for all employees; encourages employee involvement in decision-making as an integral part of the developmental process; active in self-development; as a key expert, remain current with industry trends and best practices; creates a strong culture and environment of respect, kindness, equity and inclusion. LEADERSHIP: Known for achievement, and recognizes that achievement requires leadership and the participation others; recruits the right people; engages team members, colleagues, stakeholders, other IT professionals in efforts to achieve the vision of the University; solves problems that arise; evaluates progress towards goals and objectives; negotiates resolutions to conflicts; protects/heals the organization during times of change; integrates stakeholders, colleagues, IT professionals, team members to achieve synergies; promotes and supports a culture of health, safety, well-being and respect and ensures compliance with related policies and legislation to sustain a healthy work environment. EMBRACING CHANGE: Exhibits a passion for building creativity, innovation, and excellence at the University; works effectively with ambiguity and complex issues, while searching for innovations and improvements to make tomorrow's workplace better than today's; identifies, recommends, and implements new information security service opportunities in the functional areas of responsibilities of this role, as well as opportunities to improve existing information security solutions and processes; continually improves related systems and processes. OPTIMIZING SERVICE RELATIONSHIPS: Identifies, builds, and manages relationships which are service oriented, and constituent centered; systematically creates positive current and future benefits for the University; builds effective relationships, internally and externally to increase effectiveness in conducting assigned duties PLANNING: Demonstrates initiative and effective planning and organization skills to meet deadlines and complete quality work; demonstrates understanding of higher education issues and fiscal awareness through course of work and contributes to fiscal responsibility, efficiency, and restraint. TECHNICAL COMPETENCIES: ADVANCED RESEARCH COMPUTING (ARC) Infrastructure and Technology: Expert in implementing and managing advanced research computing technologies, including HPC clusters, parallel processing, and large-scale data management. Demonstrates in-depth knowledge of operating systems (Linux, Unix, Windows), system architecture, and cross-platform interoperability. Proficient in cloud-based ARC environments and hybrid models, as well as containerization and orchestration tools to support research workflows. Strategic Focused Delivery; Brings a strategic focus to all aspects of systems and services under the direction of the Director, Research and High-Performance Computing Support. Develops strategic objectives, targets, and continuous improvement initiatives for ARC functions, ensuring alignment with institutional goals and research trends. Anticipates future infrastructure needs, staying abreast of emerging technologies, such as next-generation computing architectures, artificial intelligence, and cloud-based ARC solutions. Proactively evaluates and integrates these advancements to evolve the ARC ecosystem and support innovative research capabilities. Business Continuity/Disaster Recovery; Identifies, develops, and implements comprehensive business continuity and disaster recovery plans. Ensures the continuous delivery of critical advanced research computing services and the protection of data and infrastructure in case of disruptions. Leads the design and execution of backup strategies, replication solutions, and recovery processes to safeguard research data. Documentation and Reporting: Maintains comprehensive, up-to-date documentation for ARC infrastructure, including system configurations, procedures, policies, and security protocols. Generates automated reports using dashboard tools to provide real-time insights into system performance, resource utilization, and project status for leadership and research stakeholders. Ensures compliance with institutional and external reporting requirements, particularly related to research data privacy and security. Financial Management; Contributes to the budgeting process for the RHPCS department by providing in-depth technical insight into current and future ARC infrastructure needs. Assists with forecasting, cost estimation, and resource allocation to optimize spending and ensure alignment with grant funding and institutional priorities. Portfolio Management; Leads the planning, organization, and execution of advanced research computing projects, ensuring alignment with institutional research goals and priorities. Applies risk management techniques to assess project feasibility, mitigate issues, and deliver on objectives. Collaborates with researchers and stakeholders to ensure projects meet scientific and technical requirements. Quality Assurance; Designs, develops, and implements quality assurance processes for ARC infrastructure, including benchmarking and stress testing of computing and storage systems. Conducts system audits and performance reviews to ensure ongoing quality, reliability, and optimal resource utilization. Ensures research computing environments meet rigorous standards for scalability, robustness, and compliance. Security and Compliance; Implements security protocols to protect advanced research computing infrastructure from unauthorized access, data breaches, and cyber threats. Ensures compliance with institutional data privacy standards and research data protection policies. Collaborates with cybersecurity experts to maintain the highest standards of security and confidentiality across research computing systems. Strategy / Policy: Provides expert input to inform policies governing research computing, including security, data retention, backup, and service scope. Ensures compliance with institutional policies on research infrastructure while advocating for ARC investments that align with McMaster’s long-term goals. Drives the adoption of sustainable computing practices, optimizing the environmental and financial impact of advanced research computing infrastructure. Supervise / Manage Teams: Leads a team of research computing professionals, overseeing their training, development, workload management, and performance evaluation. Demonstrates a strong ability to manage distributed teams, fostering collaboration and accountability. Builds a high-performing team culture that ensures the consistent delivery of exceptional research computing support and services. System Optimization: Continuously identifies and implements performance-tuning measures for computing, storage, and network systems. Conducts regular system audits, applying industry best practices to maximize resource utilization, improve energy efficiency, and optimize system performance for complex research workflows. Proficient in profiling and optimizing parallel computing applications. Vendor and Stakeholder Management: Leverages experience with technology vendors and external stakeholders to achieve business outcomes. Provides technical leadership in negotiating service-level agreements (SLAs), specifying technical requirements, and managing vendor contracts. Ensures timely, cost-effective delivery of ARC services by monitoring vendor performance and maintaining strong relationships to meet institutional research goals. Service Management: Ensures the reliability, scalability, and performance of advanced research computing infrastructure, including high-performance computing systems, data storage, and high-bandwidth networking. Applies the ITIL framework (Incident, Change, and Problem Management), capacity planning, and automated provisioning strategies to maintain operational efficiency. Proficient with monitoring and alerting tools to ensure system uptime, resource optimization, and proactive incident response. Discretion: Handles sensitive and confidential information, including research data, institutional policies, and proprietary technologies. Ensures strict adherence to data privacy regulations and confidentiality protocols, always safeguarding institutional and researcher data. Planning and Implementation of Systems: Assists faculty and research groups in the planning, design, and implementation of advanced computing installations. Provides detailed design specifications for ARC infrastructure, hardware, and software configurations. Ensures that installations align with institutional research priorities and are completed on time and within budget. Liaises: Liaises with other university departments, stakeholders, and external partners to ensure the successful delivery of advanced research computing services. Provides technical input for grant applications, helping researchers secure funding for computational resources. Regularly interacts with researchers to balance competing demands and ensure high service quality. Research: Conducts research on emerging high-performance computing methodologies and technologies, evaluating their efficacy for specific scientific programs. Stays informed of the latest developments in advanced research computing, contributing to knowledge-sharing and innovation within the department. SUPERVISON
EDUCATION, QUALIFICATIONS, AND CERTIFICATIONS
************************************************************** Personal Attributes Core Business Interpersonal Attributes
Core Interpersonal Attributes
Work Conditions
For more information, visit McMaster University for SENIOR MANAGER, ADVANCED RESEARCH COMPUTING (ARC) |