Post Job Free
Sign in

Solution Architect Big Data

Location:
Pune, Maharashtra, India
Posted:
September 09, 2025

Contact this candidate

Resume:

Tushar Pathare

Solution Architect Cloud - High Performance Computational and Data Ecosystem

Seeking a challenging Solution Architect at your company leveraging 16+ years of experience in high-performance computing, big data, and solution architecture to design, optimize, and secure large-scale computational and data ecosystems. Expertise in IBM Spectrum Scale, cloud technologies, and cybersecurity, coupled with a proven track record of leading complex deployments, managing large datasets, and ensuring compliance, makes highly suitable to contribute to your company's mission of driving innovation in the products. Eager to leverage skills and experience to enhance infrastructure reliability, data security, and operational efficiency for your company's high-performance computing environment.

Mobile: 983-***-**** E-mail: ******.*******@**********.*** Address: Pune, IN DOB: 16-09-1985 LinkedIn: https://www.linkedin.com/in/tushar-pathare-564540162 Research Gate: https://www.researchgate.net/profile/Tushar-Pathare Science Direct: https://www.sciencedirect.com/science/article/abs/pii/S1877750315300223 PROFESSIONAL SUMMARY

Designed and implemented scalable HPC architectures and data storage solutions using IBM Spectrum Scale for large-scale genomic data processing and analysis.

Led end-to-end IT infrastructure operations, ensuring seamless integration and high performance for a major European client's production environment.

Developed and implemented security frameworks for managing sensitive data, ensuring compliance with ISO 27001 standards. Possess extensive experience in performance testing and optimization of large-scale file systems, including IBM Spectrum Scale (GPFS). Successfully collaborated with development teams to enhance product capabilities and ensure compatibility with various platforms, including cloud environments and Hadoop ecosystems.

Performance tuning of GPFS/Spectrum scale cluster for maximum performance of healthcare applications Infiniband management using verbsRDMA for GPFS.

Job Scheduling on IBM LSF on-prem as well as of-prem. TECHNICAL SKILLS

IBM Spectrum Scale HPC Cluster Management Data Security Cloud Computing Linux Kernel Virtualization Hadoop Ecosystem Data Warehousing ETL Cybersecurity Amazon Web Services WORK EXPERIENCE

Solution Architect

Trianz Digital Consultancy Pvt Ltd (Pune) (Jan 2025 - May 2025) As a Solution Architect at Trianz, I design and architect cutting-edge cloud solutions for high-performance workloads, primarily for clients in the oil and gas sector and stock exchanges. I work closely with these clients to understand their unique business challenges, creating tailored solutions that optimize their infrastructure. By designing scalable, efficient, and secure cloud architectures on AWS, I help clients achieve superior performance, enhance innovation, and drive cost efficiency. My deep technical expertise and industry insights empower clients to meet their demanding computational needs and maintain a competitive edge. I’ve architected solutions that integrate with AWS ParallelCluster for managing clusters, AWS Batch for efficient job scheduling, and leveraged Amazon SageMaker for AI/ML workloads in oil and gas exploration and stock market prediction models. Additionally, I’ve implemented fault- tolerant, highly available architectures with multi-region deployment to ensure business continuity, while optimizing cost by utilizing AWS Reserved Instances, Spot Instances, and Auto Scaling. In the stock exchange domain, I’ve designed low-latency, high-throughput systems to handle real-time market data processing, and for the oil and gas industry, I’ve delivered solutions that support seismic data processing and reservoir simulation at scale. My role also involves ensuring security and compliance with industry regulations, implementing encryption, and identity and access management (IAM) best practices to protect sensitive data.

Solution Architect IBM Spectrum Scale and Solution Architect for a Healthcare tech Eviox Tech Pvt Ltd (Dec 2020 - Dec 2024)

Responsibilities included providing technical guidance, conducting solution assessments, developing implementation plans, and collaborating with clients to meet their specific business objectives. Delivered consultancy services for IBM Spectrum Scale, a high-performance clustered file system, to optimize data storage and management for demanding workloads.

Designed and architected healthcare technology solutions leveraging expertise in HPC, big data analytics, and cloud computing to address industry-specific challenges

Performed Storage management using IBM Spectrum Scale (GPFS) and Cluster management(xcat,PCM,BCM) activities on day to day basis.

Scheduled IBM LSF Jobs for Researchers and Scientist for Healthcare Tech,Telecom tech,etc Deploy DDN Gridscale cluster and perform cluster management activities as well Technology Catalyst(HPC Administrator)

Sidra Medicine (Jun 2014 - Dec 2020)

Responsibilities included managing and optimizing HPC infrastructure, collaborating with researchers and IT specialists, and contributing to the successful execution of the Qatar Genome Project.

Perform Storage management(GPFS) and Cluster management(xcat,PCM,BCM) activities on day to day basis. Scheduled LSF Jobs for Researchers and Scientist for Biomedical Research. Led the design and implementation of scalable HPC architectures and data storage solutions to support genomic research and analysis initiatives.

Engineered a dynamic query engine to integrate heterogeneous data sources, enabling seamless access to genomic, phenotypic, and clinical data.

Developed and implemented a standardized genomic data processing pipeline, including alignment, variant calling, and annotation tasks, using industry-standard tools and technologies.

Established secure and compliant data management practices, adhering to ISO 27001 standards, to safeguard sensitive genomic and patient information.

Leveraged machine learning and data analytics techniques to enhance network security, perform image recognition in genomic research, and optimize HPC operations.

Achievements:

Successfully integrated genomic and clinical platforms, enabling efficient data stewardship and analytics for improved research outcomes. Implemented Apache NiFi to facilitate real-time data ingestion and transformation for enhanced analytical workflows. Contributed towards developing and implementing an information security framework for the research division, ensuring compliance with ISO 27001 standards.

Team Lead

Tata Consultancy Services (Mar 2014 - Jun 2014)

Responsibilities included managing project timelines, allocating resources, mitigating risks, and serving as the primary liaison between the client and technical teams.

Led a team of engineers in delivering end-to-end IT infrastructure operations for Barry Callebaut, a leading global chocolate manufacturer. Oversaw the design, deployment, and management of scalable IT solutions to support the client's production and operational needs. Achievements:

Ensured seamless integration and deployment of the entire production infrastructure, meeting stringent performance and availability requirements.

Successfully minimized system downtime and optimized reliability to support the client's global operations. IBM Software Engineer

IBM (Jul 2007 - Mar 2014)

Responsibilities included designing test plans, developing test cases, automating test workflows, analyzing performance bottlenecks, and collaborating with development teams to resolve issues and enhance product quality. Conducted rigorous performance evaluations of IBM Spectrum Scale (GPFS) under diverse workloads, ensuring high throughput and low latency in clustered environments.

Designed and executed test cases to validate new features for IBM Spectrum Scale, including scale-out NAS (SONAS) and WAN caching solutions, ensuring compatibility with various Linux distributions and virtualization platforms. Collaborated with development teams to enhance product capabilities, contributing to the product's successful integration with Hadoop ecosystems like Apache Drill, Spark, and Hive for seamless big data workflows. Provided technical expertise and presentations to customers, highlighting IBM Spectrum Scale's capabilities and ensuring alignment with their specific requirements.

Achievements:

Contributed to the development of test automation scripts and tools to streamline testing processes and analyze system behavior under extreme conditions.

Played a key role in improving the robustness and adaptability of IBM Spectrum Scale, leading to its widespread adoption in enterprise, HPC, and data-intensive environments.

EDUCATION

Bachelor's degree, Information Technology (2004 - 2007) Pune Institute of Computer Technology

Master of Business Administration - MBA (IT & Strategic Innovation Masters), Information Technology (2015 - 2017) Kingston University

Higher Secondary School (Jun 2001 - Jun 2003)

At Vincent’s High School and Junior college, Camp Pune CERTIFICATIONS

AWS Certified Solution Architect - Associate (AWS - 2025) COMPTIA Linux+ (COMPTIA - 2025)

HashiCorp Certified:Terraform Associate (Hashicorp - 2025) ISACA CXSF (ISACA - 2019)

ACHIEVEMENTS

Technical Paper Release in science direct journal

PROJECTS

Qatar Genome Project

Managed the integration of genomic data across high-performance computing (HPC) clusters, leveraging IBM Spectrum Scale to establish a robust and scalable infrastructure for the Qatar Genome Project (QGP). Designed and deployed HPC architectures optimized for large-scale genomic data processing and analysis, ensuring high throughput, low latency, and seamless data access across multiple compute nodes. Built and implemented a standardized genomic data processing pipeline, encompassing alignment, variant calling, and annotation tasks, to ensure efficient and consistent data analysis.

Developed and implemented an information security framework aligned with ISO 27001 standards to safeguard sensitive genomic data, incorporating access control and data encryption mechanisms to meet regulatory and privacy requirements. Engineered a dynamic query engine to integrate heterogeneous data sources, including genomic, phenotypic, and clinical data, creating a unified platform for researchers.

Utilized advanced analytics and machine learning tools like TensorFlow and Elastic Stack (ELK) to enhance network security, perform image recognition in genomic research, and optimize HPC operations. Led cross-functional teams of bioinformaticians, researchers, and IT specialists to align technical solutions with scientific goals and ensure project success.

Team leader infrastructure

Led a team of engineers at Tata Consultancy Services to deliver end-to-end IT infrastructure operations for Barry Callebaut, the world's largest chocolate producer.

Managed project timelines, resource allocation, and critical milestones, ensuring adherence to quality standards and client expectations throughout the project lifecycle.

Successfully deployed scalable, high-performance IT solutions tailored to Barry Callebaut's production and operational needs, resulting in a robust and reliable IT infrastructure supporting global operations. Acted as a primary technical liaison between client stakeholders and technical teams, effectively translating business needs into actionable technical solutions and ensuring project alignment with client objectives. Mentored team members to enhance technical proficiency and fostered a collaborative, problem-solving-oriented environment, resulting in a high-performing team capable of managing complex infrastructure projects independently. IBM Spectrum Scale (QA and Dev)

Specialized in testing and enhancing IBM Spectrum Scale (GPFS), focusing on performance optimization, feature validation, and seamless integration with various platforms and technologies. Conducted rigorous performance evaluations under diverse workloads, ensuring the file system met stringent high-throughput and low- latency requirements for enterprise deployments.

Designed and executed test cases to validate new features, including integration with scale-out NAS (SONAS), WAN caching solutions, virtualization platforms (KVM, QEMU), and cloud environments (IBM Bluemix, OpenStack). Collaborated with development teams to enhance product capabilities and ensure compatibility with Linux distributions like SLES and RHEL, focusing on file system consistency during upgrades and migrations. Developed custom scripts and tools to automate test workflows and analyze system behavior under extreme conditions, including data- intensive applications and high-concurrency environments. Contributed to the successful integration of IBM Spectrum Scale with Hadoop ecosystems like Apache Drill, Spark, and Hive, ensuring seamless functionality in big data workflows.

SKILLS

Core Competencies: High-Performance Computing (HPC), Data Management & Storage, Cybersecurity & Compliance, Solution Architecture

& Design, Team Leadership & Project Management, GPFS/Spectrum Scale, Cloud Computing Soft Skills: Communication, Problem Solving, Leadership, Teamwork, Analytical Skills, Adaptability, Time Management, Creativity, Mentorship, Negotiation

HOBBIES

Technology, Genomics, Research, Mentorship, Writing LANGUAGES

English, Marathi, Hindi

#CreatedByOutspark#



Contact this candidate