Akshay B
Coppell, Texas, United States
********@*****.*** 817-***-****
linkedin.com/in/akshay-b-08345b1a9
Summary
Data Engineering Project Lead with a demonstrated history of working in the information technology and service industry. Extensive experience in the Financial, Insurance, Telecom and HR domains. Skilled in Spark(Core/SQL/ Streaming/Machine Learning), Scala, Python, Snowflake, DBT, Netezza, DataStage, Kafka, Hive, Sqoop, HBase, Apache Phoenix, Hadoop, Software Development Life Cycle (SDLC), Requirements Analysis, Application Support, and Mainframe. Strong program and project management professional with a Master's degree from UNT and Bachelor's Degree from University of Rajasthan.
Total of 16+ years of overall IT Experience with 84 months of Data Engineering, 36 months in Leadership (Project and Service Management), 36 months of Business Analyst in Property & Casualty Insurance, and rest of IT experience as a Software Developer in Mainframes.
• Working on lead activities involving migration project from Netezza to Snowflake using lift-n-shift approach
• Worked on building an Enterprise Data Lake with data from various sources like Oracle server, SQL Server, Postgres and DB2. cleansing and transforming the data. Involved in migration to AWS.
• Involved in migrating in-house data lake to AWS cloud using services Glue, Athena, S3, Kinesis, Lambda, Step functions, EMR, DynamoDB, EC2
• Transformed the data using AWS Glue dynamic frames with PySpark; cataloged the transformed the data using Crawlers and scheduled the job and crawler using workflow feature
• Kinesis firehose to consume from KAFKA and write to S3, Athena to query the data Experience
Data Engineer/Analyst
USAA
Oct 2020 - Present (2 years 6 months)
Working on building Data pipelines to migrate data from Netezza to Snowflake, the data ingested from multiple sources and placed on Snowflake using multiple technologies
• Recently involved in snowflake migration from IBM Netezza
• Data received from Workday, SumTotal, SnapLogic, from other external sources through API and real time data through KAFKA
• Landing zone of the source files is AWS S3 to Hadoop servers using in-house framework MFP and files are eventually loaded over Snowflake
• ELT framework is used where Transformation is carried out through DBT and processed data is placed over Snowflake
• Control-M is used to design and schedule the workflows
• CI/CD pipeline are run through Git-lab and UCD is used to deploy code in various environments.
• Utilize agile development through Kanban board to complete user stories. Sr. Big Data Engineer
Capital Group
Dec 2018 - Sep 2020 (1 year 10 months)
Akshay B - page 1
Worked on building an Enterprise Data Lake with data from various sources like Oracle server, SQL Server, Postgres and DB2. cleansing and transforming the data. Involved in migration to AWS.
• Involved in migrating lake to AWS cloud using services such as Glue, Athena, S3, Kinesis, Redshift, EMR as an ongoing activity
• Transformed the data using AWS Glue dynamic frames with PySpark; cataloged the transformed the data using Crawlers and scheduled the job and crawler using workflow feature
• Kinesis firehose to consume from KAFKA and write to S3, Athena to query the data
• Involved in data ingestion from various sources using Sqoop and Kafka for near real time data.
• Used different file formats like ORC, Parquet according to the requirement.
• Involved in building Spark application to cleanse and transform the data using Scala
• Experienced in handling large datasets using Partitions, Spark in Memory capabilities.
• Utilize agile development delivering tangible product by end of each sprint. Big Data Developer
DXC Technology
Feb 2017 - Nov 2018 (1 year 10 months)
• Involved in creating Data lake by importing tables/data from Netezza to HDFS using Sqoop.
• Rewritten queries from Netezza to Hive and optimized them for better performance
• Used Spark API over Hadoop YARN to perform analytics on data in Hive
• Created Hive External tables with partitioning/Bucketing to store the processed data.
• Involved in building a POC using Spark ML and GraphX to automate Regression Test bed
• Performed text analytics and processing, using the in-memory computing capabilities of Spark Big Data Developer
DXC Technology
Jun 2015 - Jan 2017 (1 year 8 months)
• Involved in building a POC using Spark ML and GraphX to automate Regression Test bed
• Performed text analytics and processing, using the in-memory computing capabilities of Spark
• Involved in creating Data lake by importing tables/data from Netezza to HDFS using Sqoop.
• Rewritten queries from Netezza to Hive and optimized them for better performance
• Used Spark API over Hadoop YARN to perform analytics on data in Hive
• Created Hive External tables with partitioning/Bucketing to store the processed data. Senior Software Engineer
DXC Technology
Apr 2009 - Jul 2013 (4 years 4 months)
• Mentored the team and provide technical guidance.
• Involved in the Impact analysis and requirement gathering phase
• Involved in SDLC activities Analysis, Design, testing, Implementation and production support
• Providing the estimates in the Application in the development team
• Responsible for maintaining the Quality Standards in all the development and enhancing activities performed, from the stage of Analysis, preparing design document till documenting Software Engineer
DXC Technology
Jun 2007 - Mar 2009 (1 year 10 months)
Akshay B - page 2
• Involved in SDLC activities Analysis, Design, testing, Implementation and production support
• Responsible for maintaining the Quality Standards in all the development, enhancing activities performed, from the stage of Analysis, preparing design documents and mentoring new resources Software Engineer
Datrata Infotech Private Limited
Dec 2004 - May 2007 (2 years 6 months)
• Involved in SDLC activities Analysis, Design, testing, Implementation and production support
• Responsible for maintaining the Quality Standards in all the development and enhancing activities performed, from the stage of Analysis, preparing design documents Education
UNT College of Engineering
Master's degree, Computer Science
2013 - 2015
University of Rajasthan
Bachelor's degree, Electrical, Electronics and Communications Engineering 2000 - 2004
Skills
Computing • Project Management • Python (Programming Language) • Hive • Application Programming
Akshay B - page 3