Skills
Languages: Python, R, Bash, Unix Systems
I am most proficient in Python and I consistently use it in my projects. I am also familiar with R and Bash Scripting. In regard to Computer Science, I have a very good understanding of data structures, searching and sorting algorithms and the concepts of time and space complexities.
Machine Learning: TensorFlow, Scikit-Learn
For building models, I mainly use TensorFlow, from custom neural networks to simple regression and classification models. Along with this, I also make good use of Scikit-Learn. Its summary and results table features are really handy for assessing how my models are doing, helping me fine-tune them for better performance.
Data Visualization: Tableau, Matplotlib, Seaborn
Data visualization is one part of the data science project life cycle that I enjoy a lot. I have a lot of experience in Tableau, from creating interactive dashboards to customized filters, fields, parameters, and actions. When working in python I mostly use Seaborn, Matplotlib, and Plotly. When I work in R, I feel most comfortable using GGplot2.
Data Cleaning: Pandas, Numpy
During a project I spend a large chunk of time in cleaning and organizing data. I constantly use Pandas and NumPy for those stages. Those tools become also handy when I do my initial investigation on the data. It also helps me discover patterns, test hypothesis and check for assumptions with the help of summary statistics and graphical representations.
Databases: SQL, MongoDB, Apache Spark
I've developed sophisticated Oracle SQL backends for Tableau reports, enhancing performance through custom indexing, partitioning, and duplication reduction. I also have experience with MongoDB and Apache Spark, focusing on unstructured data management and efficient big data processing.
Version Control Systems: Github, Virtual Environments
I am most proficient in Python and I consistently use it in my projects. I am also familiar with R and Bash Scripting. In regard to Computer Science, I have a very good understanding of data structures, searching and sorting algorithms and the concepts of time and space complexities.