Jose P.Data Scientist
Skills
A Data Scientist and Engineer skilled in mathematics, statistics, and computer science with several years of experience in data analysis projects. Responsible for leading the entire process of Data Mining, Modeling, and Business Analytics/Science, contributing significantly to the improvement of analyses and the development of efficient methods to ensure high-quality results. Possesses experience in data engineering, including acquiring information and implementing data ingestion pipelines to support Business Analytics/Science projects.
Senior Data Scientist
5/1/2023 - Present
Developed expertise in data extraction, transformation, and ingestion (ETL) processes while working as a Data Scientist/Engineer in the banking sector. Gained proficiency in Python and PySpark for handling large-scale data processing and analysis. Demonstrated extensive knowledge in managing and querying databases using PostgreSQL, DB2, and Oracle.Data Scientist and Data Engineer
8/1/2022 - 3/1/2023
Contributed to a B2B and Global Marketing team by extracting and manipulating large-scale, high-volume databases using PySpark. Conducted the entire ETL process, data mining, modeling, and business analytics, providing insights into operations performed by millions of customers across various sales segments. Supported business teams in making informed strategic decisions based on reliable data. Demonstrated expertise in AWS Cloud solutions, including the development of data ingestion pipelines and extraction of data via APIs such as Salesforce and SolucX. Utilized mathematical concepts such as statistical tests and distributions, and worked with an array of programming languages and tools including Python, PySpark, Glue, Lambda, Databricks, AWS S3, SQL Server, and Gitlab. Recognized patterns in data and detected consumption profiles by geographic region through modeling techniques such as Logistic Regression, Bayesian Learning, Random Forest, and Light-GBM. Transformed large amounts of unstructured data into usable formats and managed the deployment and updating of dashboards.Data Analyst
12/1/2021 - 8/1/2022
Part of the Data Analytics team supporting the development of data analysis in the innovation ecosystem, focusing on mapping Open Innovation programs of large Brazilian companies with startups to understand their partnerships and innovation endeavors. Utilized computational tools for data analysis, non-relational database manipulation, and advanced analytics techniques. Developed a thorough documentation of the company's database, detailing collections within MongoDB. Modeled data employing Logistic Regression, Bayesian Learning, and Random Forest techniques, analyzing innovation demands of large client companies. Emphasized storytelling through insights and key performance indicators (KPIs) and implemented and maintained dashboards. Demonstrated technical proficiency with programming languages and tools including No-SQL, SQL, Python, and MongoDB.
Applied and Computational Mathematics at Federal University of Sergipe
2013 - 2017Computational Modeling at Federal University of Paraíba
2018 - 2020
SQL for Data Science at dnc.group
2/1/2022Scrum Fundamentals at dnc.group
2/1/2022Python Zero at dnc.group
1/1/2022Big Data - Business at Semantix
12/1/2021Journey to the Legendary Dashboard at Mamba Treinamentos
10/1/2021
Jose is available for hire
Hire Jose P.All Howdy Candidates are vetted for skills and english proficiency.