Post Job Free

Resume

Sign in

Data Engineer Analysis

Location:
San Bernardino, CA
Posted:
September 20, 2023

Contact this candidate

Resume:

AKSHARA PRIYA PEDDI

Data Engineer, Accenture. California State University San Bernardino

adzuc3@r.postjobfree.com 240-***-**** https://www.linkedin.com/in/aksharapriyapeddi TECHNICAL SKILLS

• Programming Languages: Python, Java, C, HTML, SQL, R, Embedded C

• Cloud: Cloud Platforms (AWS, Azure); Docker; Kubernetes; Cloud Infrastructure Databases (MySQL, HBase); Apache Spark; Hadoop; Map- Reduce, Hive, Snowflake, Pig, Saas

• Networking: TCP/IP Stack; OSI Model; Routing and Switching (EIGRP, BGP, OSPF, VLANS, MPLS, Multicast protocols), Data Center Networking, IPv4 and IPv6.

• Security: Blockchain development; Network Systems Security; Cloud Security.

• Tools and OS: Tableau, Microsoft Power BI, Big Query, DOMO, MongoDB, Linux.

• Certifications: Agile Magement; Power BI; Data Analytics WORK EXPERIENCE

Inland Empire Health Plan Rancho Cucamonga, CA

Quality Engineer Intern July ’23 - Present

• Develop and design reports and analyses using Microsoft applications (MS Access, SSRS) and other reporting software (SAS, SPSS, Tableau, Azure Devops).

• Generate and manage updated case studies for reporting and continuous learning purposes.

• Developed multiple POC’s using Spark, Scala and deployed on the Yarn Cluster, compared the performance of Spark, with Hive and SQL.

• Use Amazon Elastic Cloud Compute (EC2) infrastructure for computational tasks and Simple Storage Service (S3) as storage mechanism.

• Capable of using AWS utilities such as EMR, S3 and Cloud Watch to run and monitor Hadoop and Spark jobs on AWS.Gather, analyze data, narratives, and gap analyses to provide insights for transformation projects California State University, San Bernardino San Bernardino, CA Student Assistant – Career Centre Aug ’21 - Present

• Assisting the career center with any data-related duties they may have. It can include tasks such as organizing and analyzing data on career paths, job markets, and salary ranges to provide students with accurate and up-to- date information.

• Responsible for creating and maintaining databases that track student job placement and career outcomes, as well as collecting and analyzing feedback from students and employers.

• Worked on strong analytical and organizational skills, as well as proficiency with data analysis software such as Excel or SPSS.

• Worked on creating reports and presentations based on the data I collect, which will require strong communication and presentation skills. Accenture Hyderabad, India

Data Engineer Jun ’19 - Aug ’21

• Designed and implemented ETL processes using Apache Spark and Hive to process and transform data from various sources.

• Developed and maintained data pipelines using Python and Scala to automate data integration and processing tasks.

• Designed visualizations and performed data analysis, which resulted in process improvement.

• Automated Sqoop jobs for extracting the data from different Data sources like PL/SQL to pushing the result set data to Hadoop Distributed File System.

• Developed MapReduce pipeline for Feature extraction using Hive.

• Developed and maintained data quality and data governance processes to ensure the accuracy and reliability of data.

• The developed prototype automates the CI/CD pipeline reducing the engi- neering and testing workload by upto 40%.

Infotech Inc Hyderabad, India

Data Engineer Intern Jan ’19 - May ’19

• Delivered clerical support by handling a range of routine and special requirements.

• Maximized productivity by analyzing protocols and identifying areas for improvement. Contributed to content creation for the company website.

• Achieved management recognition by designing and implementing special projects during an internship.

• Performed Data Cleaning, features scaling, features engineering using pandas and Numpy packages in Python. Experience developing in pipelines in GCP& Azure EDUCATION

California State University San Bernardino

Masters of Science in Information & Technology - 3.8 / 4.0

Aug ’21 - Aug ’23 San Bernardino, CA

Bhoj Reddy Engineering college for women

Bachelor of Technology ECE - 3.64 / 4.0

Aug ’15 - May ’19 Hyderabad, India

PROJECTS

Ron’s Parking Services

Programming Language: Web Dev, Python Flask & MySQL Apr ‘23 Team based project that improved parking and traffic management for CSU, San Bernardino by a 30% increase which allowed our team to win the AWS Hackathon and received commendation from the college. Used MySQL for the database, Flask for the backend, Bootstrap for the front end, with assistance from Lambda & Aws web hosting services. Data Connect for Equipment

Programming Language: PostgreSQL, DBeaver Jan ’23 Created a tool to read data from a sensor module and store it in blocked memory of an FPGA. The processor also pulls information from the blocked memory to show as an output waveform. The tool was implemented in VHDL, allowing the processor to read and analyse the data as well as collect and store it, resulting in a 45% gain in efficiency.

House Price Prediction( Kaggle Project)

Programming Language: R Nov ’22

Created 80 potential predictors for multiple regression models, imputed missing data, visualized relationships using effective predictors, and predicted final selling prices of houses. Smog Data Analysis

Programming Language: Xilinx ISE, Microsoft office Suite Oct ’22 Implemented Collected information from US Environmental Protection Agency Website. Refined the Major mathematical metrics used to generate the Air Quality Index by carefully examining the data to gain insights into them. Developed SQL queries to yield business insights.

Distributed File System identical to HDFS

Programming Language: Python Jun ’22

Implemented a HDFS-like distributed file system consisting of 5 storage servers which store the file, and one naming server. The developed file system supports file reading, writing, creation, dele- tion, with file locking and replication.

RAFT Consensus Algorithm

Programming Language: Java Feb ’22

Designed and developed a distributed system and implemented RAFT replicated service on top of it. The resultant system allows the service to continue operating consistently even if some of its servers experience frequent failures.

Multi-Threaded Network Proxy Server with Web Object Caching

Programming Language: C++ Nov ’21

Designed and developed a concurrent, multi-threaded proxy server with Readers/Writers lock. The proxy server also has an LRU cache which caches web objects by storing local copies, and responding to later requests directly.



Contact this candidate