7 min to read
The Price of Choice
Maximizing profit in property sales
A data analyst should have the skills to perform comprehensive analysis, covering tasks such as data cleaning, exploratory data analysis (EDA), hypothesis testing, and regression analysis. These analyses lead to valuable insights and business recommendations. In the RevoU course, I had the opportunity to demonstrate these skills in a project, which further enhanced my proficiency in using spreadsheets as a data analyst.
This project was completed as part of the assignment for the RevoU Full Stack Data Analytics course.
In this project, all the analyses were performed using XLMiner Analysis ToolPak Add-on in Google Sheets.
Business understanding
Background
The ABC Company, a property listing company in Malaysia, aims to offer users and tenants a wide range of property options while maximizing their profits. They adopt a 20% joint-profit sharing model, where properties with higher prices generate a greater percentage of revenue. However, selling high-priced properties with numerous rooms or large sizes can be difficult.
Dataset overview
The dataset contains 5,000 property listings in Malaysia. You can access the spreadsheet here, which includes the datasets, data dictionary, and analysis results.
The dataset includes various numerical variables, like size
, rooms
, bathrooms
, and car parks
, that provide insights through descriptive and inferential statistics. They help us understand the overall size of properties, which can be valuable information for properties that may be challenging to sell.
On the other hand, categorical variables such as location
, property type
, property character
, and furnishing
focus on gaining insights into demand based on location and characteristics. These insights can be particularly helpful in selling high-priced properties because certain locations or property types, such as strategic areas or those with high land values, may make users and tenants willing to compromise on size or number of rooms.
Key ideas
To effectively analyze the data, there are a few key steps to follow:
- Data cleaning
- Exploratory data analysis
- Statistical measurements
- Recommendations
Cleaning up the data
First, it’s essential to clean up the data, ensuring its quality and removing any inconsistencies. To be precise, I have performed the following tasks for cleaning up the data:
- Removed duplicates.
- Trimmed whitespace to remove redundant spaces.
- Uniformed HTML character codes (e.g.,
22’x100’
) to their proper values. - Ensured the correct data type for each column.
- Imputed missing values in the size column using simple linear regression.
- Removed outliers using the Interquartile Range (IQR) method.
These actions resulted in the successful removal of 10.72% of the data.
Exploratory data analysis
Once the data was cleaned, I conducted descriptive statistics to get a clear picture. I specifically focused on the price and size columns. The price
is important because it directly affects the company’s profit, while size
gives insights into property dimensions influenced by factors like the number of rooms, which may potentially affect their marketability.
Price
Count | 4,464 | Standard Deviation | RM1,354,048 |
Minimum | RM408 | Coef. of Variation | 82.66% |
Maximum | RM17,500,000 | Skewness | 2.30 |
Mean | RM1,638,087 | Q1 | RM680,000 |
Median | RM1,230,000 | Q3 | RM2,200,000 |
Mode | RM1,200,000 | IQR | RM1,520,000 |
Range | RM17,499,592 | Lower Limit | -RM1,600,000 |
Variance | 1,833,445,632,479 | Upper Limit | RM4,480,000 |
Size
Count | 4,464 | Standard Deviation | 12,051 sqft |
Minimum | 17 sqft | Coef. of Variation | 551.16% |
Maximum | 790,000 sqft | Skewness | 62.91036701 |
Mean | 2,187 sqft | Q1 | 1,067 sqft |
Median | 1,540 sqft | Q3 | 2,499 sqft |
Mode | 1,650 sqft | IQR | 1,432 sqft |
Range | 789,983 sqft | Lower Limit | -1,081 sqft |
Variance | 145,237,347 | Upper Limit | 4,646 sqft |
Through descriptive statistics, it was found that both the price
and size
variables had a positively skewed data distribution. Therefore, the median will be used as the measure of central tendency for these variables.
Priority properties
Next, let’s explore the property characteristics and split them into groups based on the median price of each property location: luxury properties, which belong to the higher price range (Q3-Q4), and affordable properties, which fall inside the lower price range (Q1-Q2). This approach takes into consideration the market values of different locations, which are assumed to provide a more accurate representation of the land values.
location | M price | # |
ADIVA Desa ParkCity | RM2,400,000 | 1 |
Ampang Hilir | RM2,929,600 | 52 |
Bangsar | RM2,525,000 | 134 |
Bukit Kiara | RM4,988,888 | 4 |
Bukit Tunku (Kenny Hills) | RM2,240,000 | 16 |
City Centre | RM1,780,000 | 57 |
Damansara Heights | RM2,315,000 | 106 |
Desa ParkCity | RM1,900,000 | 316 |
Federal Hill | RM3,300,000 | 3 |
KL Sentral | RM1,800,000 | 63 |
KLCC | RM1,900,000 | 562 |
Mont Kiara | RM1,830,000 | 656 |
OUG | RM1,700,000 | 15 |
SEMARAK | RM3,500,000 | 1 |
Sri Hartamas | RM3,100,000 | 81 |
Taman Melawati | RM1,700,000 | 41 |
location | M price | # |
Alam Damai | RM880,000 | 1 |
Bandar Damai Perdana | RM657,000 | 6 |
Bandar Menjalara | RM838,000 | 23 |
Bangsar South | RM780,000 | 59 |
Batu Caves | RM676,775 | 23 |
Jalan Klang Lama | RM657,500 | 172 |
Kepong | RM798,000 | 179 |
KL City | RM880,000 | 39 |
Pandan Indah | RM828,000 | 5 |
Pantai | RM627,500 | 24 |
Puchong | RM825,000 | 4 |
Segambut | RM820,000 | 58 |
Sentul | RM800,000 | 104 |
Sri Petaling | RM795,000 | 47 |
Titiwangsa | RM625,000 | 8 |
Wangsa Maju | RM710,000 | 78 |
Then, the focus was placed on the top 3 locations with the highest number of listed properties, both luxury and affordable properties. These locations were determined to be priority properties due to their alignment with the company’s mission of providing users and tenants with a diverse range of property options. By considering the number of listed properties as a reliable indicator, the company ensures that users have a wide range of choices in these locations.
Finding the right price
Cum sociis natoque penatibus et magnis dis parturient montes, nascetur ridiculus mus. Aenean lacinia bibendum nulla sed consectetur. Etiam porta sem malesuada magna mollis euismod. Fusce dapibus, tellus ac cursus commodo, tortor mauris condimentum nibh, ut fermentum massa justo sit amet risus.
Recommendations
Cum sociis natoque penatibus et magnis dis parturient montes, nascetur ridiculus mus. Aenean lacinia bibendum nulla sed consectetur. Etiam porta sem malesuada magna mollis euismod. Fusce dapibus, tellus ac cursus commodo, tortor mauris condimentum nibh, ut fermentum massa justo sit amet risus.