Data Regress.

A data science blog documenting learning, projects, concepts, and how-tos of this incredible field.

Page 3 of 4

Apartment Pricing: Advance Regression Techniques

Dubai real estate
Credit: Kate Trysh

Introduction

This project is inspired by a famous Kaggle competition called House Prices: Advanced Regression Techniques.  The original project on Kaggle is based on the Boston Housing dataset and is an ideal project for newbies to hone their skills on.

The original project on Kaggle gives you full opportunity to practice data cleaning, exploratory analysis, and modeling. However, one aspect of the data science project lifecycle was missing, i.e., data scavenging and extraction. The data is already extracted for you, and you don’t know how the information was gathered.

My objective was to achieve the same goals of the original project, but by doing so, using my dataset. Instead of Boston housing, I started looking into the Dubai real estate market and opted to use one of the prominent real estate property portals for my data extraction.

Continue reading

Battle of the Neighborhoods – Dubai Edition

Introduction

For my first data science project, I am presenting a scenario for an entrepreneur who wants to open a Pakistani restaurant in Dubai.

I’ve chosen Dubai first because I’ve spent a fair amount of time there, and I know how Dubai’s localities are segmented. Second, being a melting pot of 200+ nationalities, it offers world-class dining opportunities for people visiting or residing in Dubai.

Due to its world-class status as a tourism destination, Dubai is also the best contender for the Foursquare platform, an essential requirement for this project. 

Continue reading

Let’s talk about Data Science

What is Data Science?

Do you remember when you planned on opening a new branch of your Italian restaurant but couldn’t decide where to? 

Imagine if you had a way to assess and compare different suburbs to identify where most recently, the Italian population started settling! That should be a promising place to open your new branch.

Or how about determining a reasonable and fair price for your house to sell, based on locality, prevailing rates, number of bedrooms, number of baths, or maybe that beautiful garden you have that no one else has!

By the simplest definition, Data Science is a process to extract meaning insight from raw data. 

In 2013, a study discovered that about 90% of all data in the world is generated in the past two years. By estimates, End of the year 2020 will see the size of our digital universe expended to 44 zettabytes.

So, it begs the question, what can we do with all this data? How do we make it useful to us? How can we make use of this data to answer the questions we have and solve real-world problems? All these questions come under the domain of Data Science.

Continue reading
« Older posts Newer posts »

© 2025 Data Regress.

Theme by Anders NorenUp ↑