Post Job Free
Sign in

Machine Learning Data Engineer

Location:
United States
Posted:
November 05, 2024

Contact this candidate

Resume:

MOUNICA DASARI

***************@*****.*** +1-989-***-****

SUMMARY

• Over 3+ years of professional experience working as Data Engineer and data analyst.

• Efficient in preprocessing data including Data cleaning, Correlation analysis, Imputation, Visualization, Feature Scaling and Dimensionality Reduction techniques using Machine learning platforms like Python Data Science Packages (Scikit-Learn, Pandas, NumPy).

• Working experience in Natural Language Processing (NLP) and deep understanding of Statistics/Linear Algebra/Calculus and various optimization algorithms like gradient descent.

• Experience working with NoSQL databases like Cassandra and HBase and developed real-time read/write access to very large datasets via HBase.

• Proven expertise in cloud platforms (e.g., Azure, AWS, and GCP) and big data technologies (e.g., Spark, Hadoop, Kafka). SKILLS

• Hadoop/Big Data Technologies: HDFS, Hive, Pig, Sqoop, Yarn, Spark, Kafka Spark SQL

• Machine Learning: Linear Regression, Logistic

Regression, Naive Bayes, Decision Trees, Random

Forest, Support Vector Machines (SVM), K-Means

Clustering, K-Nearest Neighbors (KNN), Random

Forest, Gradient Boosting Trees, Ada Boosting, PCA, LDA, Natural Language Processing

• Languages: C, C++, Python, Scala, UNIX Shell Script, COBOL, SQL and PL/SQL

• Tools: Teradata SQL Assistant, Pycharm, Autosys

• Operating Systems: Linux, Unix, OS and Windows

• Databases: Teradata, Oracle 9i/10g, DB2, SQL Server, MySQL 4.x/5.x

• Other tools and technologies: TensorFlow, Keras, AWS ML, Azure ML studio, GCP, NLTK, SpaCy,

Gensim, MS Office Suite, Google Analytics, GitHub, AWS—(EC2/S3/Redshift/EMR/Lambda/Snowflake)

• IDEs: PyCharm, Jupyter Notebook, Spyder

• Tools: Git, GitHub, JIRA

• Additional Expertise: Snowflake, Microsoft

Power BI, SQL Server Integration Services (SSIS),

Agile Methodologies,Data Security and

Compliance, Customer Satisfaction

• ETL Tools: IBM InfoSphere Information Server V8, V8.5 & V9.1

• Reporting: Tableau, PowerBI

PROFESSIONAL ACCOMPLISHMENTS

Hartford Financial Service Group, Data Engineer Jan 2024 - Current

• Worked on end to end machine learning workflow, written python code for gathering the data from AWS snowflake, data preprocessing, feature extraction, feature engineering, modeling, evaluating the model, deployment. Written python code for exploratory data analysis using Scikit-learn machine learning python packages- NumPy, Pandas, Matplotlib, Seaborn, stats models, pandas profiling.

• Utilized Spark SQL API in PySpark to extract and load data and perform SQL queries.

• Responsible for Design, Development, and testing of the database and Developed Stored Procedures, Views, and Triggers Developed Python-based API (RESTful Web Service) to track revenue and perform revenue analysis.

• Implemented AWS lambda functions, python script that pulls the privacy files from AWS S3 buckets to post to it the Malibu data privacy endpoints.

Coforge, Data Engineer Aug 2019 to Jul 2022

• Worked with Python NumPy, SciPy, Pandas, Matplot, Stats packages to perform dataset manipulation, data mapping, data cleansing and feature engineering. Built and analyzed datasets using R and Python.

• Responsible for Design, Development, and testing of the database and Developed Stored Procedures, Views, and Triggers Developed Python-based API (RESTful Web Service) to track revenue and perform revenue analysis.

• Spearheaded the development initiative for seamless data integration processes between on-premises systems and the

• Leveraged cutting-edge integration analytics tools to architect a modern data warehouse characterized by accuracy, and

• Advised migrating data from Teradata to AWS using Python and BI tools like Alteryx.

• Automate the data flow process in the data sources (flat files, Postgres database) to S3 bucket using Python, SQL and Alteryx tool capabilities. Also, provide data files for the tableau reporting purpose. EDUCATION

Central Michigan University, Mount Pleasant, MI

Master of Science, Computer Science

Mahatma Gandhi Institute of Technology, Hyderabad, India Bachelor of Technology, Computer Science

CERTIFICATE

• Certified by NPTEL in The Joy of Computing using Python.

• Certified by Techgyan in Ethical Hacking Workshop conducted in IITH.

• Certified in Programming for Everybody (Getting Started with Python) through Coursera.



Contact this candidate