Welcome to the Random musings of a data guy 🤖
- Things I generally think about:
- • data
- • dbt
- • data engineering
- • Data science
- • MLops
Welcome to the Random musings of a data guy 🤖
Understanding how to work with knowledge graphs can give data scientists the ability to not just extract interrelated facts and assumptions from massive collections of data, but can also help in understanding how to form contextual connections and understanding from data via linking and semantic metadata which helps provide a unified approach to data analytics and intelligence. This article is an introduction to knowledge graphs, important concepts and their applications....
Applying the gradient descent algorithm to find the minimum of the following function of four variables Introduction Gradient is an algorithm used to find the parameters that minimizes a function. We will be using the function above to implement gradient descent. To implement a gradient descent algorithm there are some important ingredients to be used which includes; The function The partial derivative of all the function The gradient descent algorithm finds the best path to descend to the minimum points in a function by finding the slope of the tangent of the function....
Abstract Generative Adversarial Networks (GAN) have seen great improvement in modelling synthetic data with results unparalleled in the field of deep learning. This research explores data augmentation for cover song identification by extending state-of-the-art GAN frameworks used in image processing to cover song identification. Research in this domain is still in the early phases, this work explores the implementation of a GAN model capable of generating new features for cover song identification....
Abstract Understanding how to work with breast cancer data to aid the early detection of breast cancer in women is very important to the health and wellbeing of women around the world. This study explores various statistical methods and techniques to analyze breast cancer related dataset, to discover if common statistical methods can be used to analyze these datasets. The impact of breast cancer on the well being of women provokes the need for both accurate and interpretable results....
The code above is to help format the notebook and make the outputs appear inline and within the document. 1.0 Data Preprocessing This is to load all the modules to be used in this analysis. The os module provides a way to interact with the Os of the machine, the glob module is used to retrieve the path names needed. Pandas,Numpy, Seaborn, Matplotlib.pyplot and Folium are modules used to interact with the dataset and plot visualizations....
Abstract The high impact of road transport on the environment calls for both scientific and social policies to control carbon and greenhouse gas emissions. The goal of this research is to study cars and taxis as a top contributor to CO2 emissions and reiterate the importance of putting policies in place to reduce car use. The CO, CO2 and greenhouse gas emissions were assessed in relation to local, national and International emissions....
Loading all packages needed using the package manager Package manager is a package that helps installs and load r packages when needed without having to call each package individually using the library function. #loading package manager require(pacman) ## Loading required package: pacman #using p_load function to load all packages used pacman::p_load(tidyverse,tidytext,stringr,tidytext,textdata,stats, reshape2,modeltools,topicmodels,tm,widyr,anytime,rtweet, leaflet,lubridate,lutz,glue,scales,twitteR,wordcloud,igraph,ggraph) Problem statement Four tasks were performed on the airline tweet dataset provided including Text mining, Sentiment Analysis, Topic Model and Additional Exploratory Analysis....
Abstract Forecasting temperature by the use of advance statistical tools is very important in understanding and dealing with the effects of rising or decreasing temperatures. This study uses Time series analysis and predictions, a statistical methods to analyze and forecast temperatures, by using the average max, min and mean temperature for each month of different regions in the United Kingdom measured by the Met Office. The increase in concerns about the effects of rising or decreasing temperature on humans, animals, the climate, oceans, and seasonal patterns provokes the need for accurate models for forecasting the temperatures of the different regions across the globe....