Post Job Free

Resume

Sign in

Data Engineer

Location:
Bellevue, WA
Posted:
July 02, 2020

Contact this candidate

Resume:

Babu

Contact: 339-***-****

C E R T I F I E D

Professional

Email:add9tl@r.postjobfree.com

Over 6+ years of experience in which 4 years of experience in DSE Cassandra & Apache Cassandra. Worked as NoSQL Data Engineer and has solid experience in Spark, Kafka, Python, Mango DB & SQL Server Databases in design, development and administration.

Experience Summary:

Strong Experience in Supply Chain Domain.

Deployed Datastax enterprise on development, testing and production servers.

Expertise in installing, upgrading apache Cassandra and Datastax enterprise both in simple strategy and Network topology strategy in single datacenter and multi datacenter environment.

Experience in installation and configuration of Datastax enterprise components like OpsCenter and DevCenter.

Good experience analyzing cassandra logs from splunk and in creating alerts in splunk.

Performed compaction, repair and tuning regularly to enhance performance.

Troubleshooted the cluster with logs and found out the root cause of issue and improved the system performance.

Strong experience in installing upgrading couchbase in all environments.

Experience in Data Modeling and working with Cassandra Query Language(CQL).

Involved in the process of data modeling Cassandra Schema and Created highly efficient data models in CQL for customer data.

Experience in designing API’s, consumers, publishers to post and consume events to external systems using message broker.

Used CQL in Java spring boot API to retrieve and write data for Cassandra tables.

Involved in developing Scala jobs to extract data from production using data frames.

Hands on experience in managing multiple databases.

Good knowledge on Datastax Search/ Solr in indexing and managing searches.

Good Experience in installing & Monitoring MongoDB.

Performed health check on monitoring infrastructure an enabled auditing & alerts.

Hand’s on experience in building DSE & Apache Cassandra clusters from scratch on perm & cloud.

Experience in bootstrapping, decommissioning, removing, replacing, and repairing nodes.

Utilized Cassandra tools including sstableloader,sstabledump, and scripts using COPY command for application data loading and analysis.

Strong experience developing scripts in shell & python for monitoring health of cluster(house Keeping Scripts).

Actively monitor existing databases to identify performance issues, and give application teams guidance and oversight to remediate performance issues.

Experience in data backup, recovery and scheduling repair service in Cassandra clusters.

Use Cassandra-stress to load tests, baselining and capacity planning clusters for new application/projects.

Configured backup, alerts, repairs using custom scripts.

Knowledge on Developing Splunk queries and dashboards targeted at understanding application performance and capacity analysis.

Configured Grafana dashboard and Prometheus server and pushed metrics from ALL PROD clusters using JMX exporter.

Good experience in installation, upgrade and configuration of Microsoft SQL Server and databases in clustered and non-clustered environments

Technical Skills: -

Cassandra DataStax Enterprise Cassandra 4.7,4.6,5.0,5.1.11,5.1.16 Apache Cassandra, Solr,Spark,Hadoop, Choesity, Splunk, Grafana, Data Dog

SQL Server Tools SSMS, SQL Server Profiler, SSIS, Query Analyzer, Performance Monitor

Languages CQL, T-SQL, C, C++, HTML, XML, Shell Scripting, Java Spring boot, Python.

Databases MS SQL.MySQL, Oracle 10g,11 g, PostgresSQL

NoSQL Databases Cassandra, HBase,MomgoDB

Operating Systems Windows, Linux, Red hat, Centos

Other Tools: Service Now,Spark 2.0,Solr, Kafka, GemFire, Rabit MQ, Pivotal Cloud Foundry, AWS,GCP.

Professional experience

T-Mobile, Bellevue, WA Jan’19 – Till Date

NoSQL DBA

Responsibilities:

Worked o dse 5.1.x versions to install,configure,deploy on premise infrastructure.

Good experience in managing Cassandra & Mongo DB in Linux environment.

Wrote shell scripts and assigned them in cron tab of Linux for automation of tasks like repairs and compaction.

Worked close with Datastax support team by rising tickets and on call support.

Involved in Data Modeling meetings with the business users and analyzing the requirement, designing tables based on the query’s fired by application users.

Responsible for working with development teams in evaluating new database technologies and provide guidance to development teams.

Involved in reviewing data models of individual applications and performing stress test at design level.

Strong experience in creating solr indexes on columns & tuning schema.xml files.

Performed Backups from production and applied RDD Transformations using spark and loaded in NPE.

Experience in designing & implementing Cache layer Gemfire(In memory database) for few capabilities.

Involved in developing JAVA API’s for validation calls from external systems.

Experience in creating notebooks using Apache Zeppelin on top of Cassandra for data visualization and sharing those notebooks to business user.

Good experience in analyzing db logs from Splunk and creating dashboards.

Extensively involved in meeting with Datastax & TLP (The Last Pickle),InstaClustr Teams to analyze cluster’s and data model walkthrough.

Hands on Experience in building DSE clusters in PROD and NPE.

Extensively used Nodetool for the cluster administration of Cassandra.

Good experience in building Opscenter and installing agents on all the PROD clusters and adding configuring alerts and graphs.

Involved in building Apache Cassandra cluster from scratch and migrating tables from DSE Cassandra to open source Cassandra.

Analyzed the thread pool stats, column family stats and column family histograms in order to find the performance bottlenecks and read, write latencies of particular key spaces and tables.

Involved in monitoring production cluster using OpsCenter & graphana tools.

Good experience on python & Spring boot API’s.

Hands on experience with oracle PL/SQL.

Environment: DSE Cassandra 5.0.5,5.1.7,5.1.11, Red Hat Linux, Spark, Sqoop, Service Now, Solr, kafka, Hadoop, AWS, Elastic Search, Kibana, JIRA.

Tdameritrade, Columbia, MD May’17 – Nov’18

NoSQL DBA

Responsibilities:

Experience on Designing, Planning, Administration, Installation, Configuring, Troubleshooting, performance monitoring and Fine-tuning of Cassandra Datastax Enterprise versions on cluster.

Worked close with infrastructure team for provision new clusters with required properties.

Strong experience in analyzing the logs(trouble shooting skills) and identifying the RCA.

Performed security reviews of critical databases across organization to ensure databases are aligned with client policies.

Used core java concepts like Collections, Generics, Exception handling, IO, Concurrency to develop business logic.

Creating required keyspaces for applications in prod, dev, test, and fst clusters.

Determining and setting up the required replication factors for keyspaces in prod, dev etc. environments in consultations with application teams.

Involved in Data Modeling applying like applying mapping rules, mapping patterns.

Creating required tables with appropriate privileges to the users and secondary indexes.

Followed benchmarking standards on setting Cassandra configuration for high throughput and productive write-heavy applications.

Involved in upgrading and performing patches on couchbase.

Ran many performance tests using the Cassandra-stress tool for tuning data model and to improve the read and write performance of the cluster.

Used Snapshot & incremental backups to take backup and restore on another node.

Bulk-loaded the data into Cassandra using sstableloader.

Implemented Solr Cluster and access data from solr.

Good Experience in using sqoop to load data to and from Cassandra cluster.

Experience in fetching and loading data to and from Cassandra cluster using Spark – Cassandra connector.

Analysis of database access patterns to isolate hotspots, data model problems, and other bottlenecks.

Worked on tuning Bloom filters and configured compaction strategy based on the use case.

Hands on experience in upgrading Cassandra versions and performing rolling restart.

Setting up Opscenter and datastax agent for monitoring cluster and enable services and alerts, rebalancing cluster etc.

Involved in writing shell scripts to monitor Cassandra cluster and generating alerts by placing them in corn tab.

Environment: DSE Cassandra 4.7,4.8, Cqlsh, Red Hat Linux, Spark, Sqoop, Service Now,Solr,kafka,Hadoop.

American Airlines, Dallas, Tx Jan’16-April ‘17

Cassandra/SQL DBA

Responsibilities:

Cassandra DBA/Developer:-

Installed, configured and deployed Datastax Enterprise Cassandra 4.6.10 and 4.7.5 with single Datacenter on multi node cluster with V-nodes.

Implemented commissioning and decommissioning of data nodes.

Involved in the process of Cassandra data modelling and building efficient data structures.

Consistency levels for read & write quries were implemented depending on the use case.

Used Data Modeling best practices like Partition per Query strategy for good performance of the Cassandra cluster, De-normalizing data for better read performance.

Involved in working on Cassandra database to analyze how the data get stored.

Good Experince in creating physical data modeling and converting them into chebatko Daigrams.

Tuned the Cassandra.yaml and Cassandra-env.sh file to enhance and improve the performance.

Worked on migration of data from Oracle DB to Cassandra using spark basing on the requirements.

Imported data into Cassandra using pyspark,scala to process the data.

Experience in working with Solr in Cassandra cluster.

Added/Bootstrapped, Removed and replaced the nodes in the cluster using the Nodetool.

Extensively used Nodetool for the cluster administration of Cassandra.

Performed backup using Snapshot commands and restored them using nodetool refresh commands.

Used sstable2json & sstabledump tools to read the data(.db) files in Cassandra.

Analyzed the performance of Cassandra cluster using TP stats and CFstats for thread analysis and latency analysis.

Good Knowledge on installing and configuring Hadoop cluster.

Applied patches for particular requirements of the cluster.

Wrote shell scripts and assigned them in Cron tab of Linux for automation of tasks like repairs and compaction.

SQL DBA:-

Installed and configured SQL Server 2014/2012/2008R2 Enterprise edition on Active/Passive Cluster and standalone environment.

Implemented SQL Server 2012 new features like AlwaysOn Availability Groups for high availability of multiple databases in place of database mirroring.

Successfully migrated from SQL Server 2008 to 2012 and also to 2014 on Development/Testing/production Environment.

Implemented Resource Governor for one of the critical server for better performance.

Applied Page and row compression to efficiently organize the space.

Monitored database system details within the database, including stored procedures and execution time and implement efficiency improvements.

Development of automated daily, weekly and monthly system maintenance tasks such as database backup, Mirroring, database integrity verification, indexing and statistics updates.

Database and SQL Query Tuning to improve performance of loads.

Reduced the dead locks by using the SQL Server Profiler hence the performance of the query is tuned.

Created automatically running stored procedures for day-end operation using SQL Server agent

Implementing different Development and Test Server instances for the Application Development Team and to co-ordinate Development and Testing Environments and keep them updated.

Daily support, troubleshooting, monitoring, optimization and tuning of server and SQL server environments across entire system.

Successfully implemented database mirroring between Primary server and Mirror Server.

Implemented log shipping to support DR servers, worked on high availability concepts like always on and clustering.

Provided 24 X 7 dedicated supports for SQL Server production server.

Environment: MS SQL Server 2012/2008R2/2005/2000, SQL Server Management Studio, SQL Profiler, Apache & DSE Cassandra,Opscenter,Devcenter,Centos 6.5,Spark,Solr,Hadoop.

Inspiredge IT Solutions April’14 – Nov ’14

SQL Server DBA

Responsibilities:

Installed and configured SQL Server 2008 Enterprise edition on Active/Passive Cluster environment.

Successfully migrated from SQL Server 2005/2008R2 on Development and Testing Environment.

Pre installation check before installing SQL Server 2008 that identified unsupported configurations.

Created SSIS packages for the ETL the data from SQL Server 2005, flat files, Excel files.

Implemented Resource Governor for one of the critical server for better performance.

Applied Page and row compression to efficiently organize the space.

Migrated new database structures, such as tables, indexes and stored procedures from development to the production environment.

Monitored database system details within the database, including stored procedures and execution time and implement efficiency improvements.

Development of automated daily, weekly and monthly system maintenance tasks such as database backup, replication verification, Mirroring, database integrity verification, indexing and statistics updates.

Implemented the Data Collector to capture the CPU, DISK, Memory counters.

Implement table partitioning to improve performance and data management.

Designed and deployed SQL Server Integration Services (SSIS) Packages, with error handling, environmental and SQL server package configurations.

Incremental loading of dimensions, facts, error handling in SSIS.

Database and SQL Tuning to improve performance of loads.

Reduced the dead locks by using the sql server profiler hence the performance of the query is tuned.

Created automatically running stored procedures for day-end operation using SQL Server agent

Creating logins and roles with the appropriate permissions.

Successfully implemented database mirroring between Primary server and Mirror Server.

Implemented log shipping in standby mode to support DR servers.

Provided 24 X 7 dedicated supports for SQL Server production server.

Environment: Microsoft Windows Server 2008 Enterprise Edition, Microsoft Windows 2003 Server, Microsoft SQL Server 2005, 2008, Microsoft SQL Server Integration Services, Microsoft SQL Server Reporting Services, Microsoft OFFICE 2007.



Contact this candidate