Sunday, October 26, 2008

diamonds

Diamond

This data set is a sampling of 617 round shape diamonds collected from retail website in December 2007. The data includes the following variables for each diamond: Price, Carats, Clarity, Color, Cut, ClarityCode, ColorCode, CutCode.

Source

Summary

  • 617
    8

Download Source Data

Current Snapshot

Previous Updates

Data Summary

Showing last 6 rows and first 4 columns
Price Carats Clarity Color Cut ClarityCode ColorCode CutCode
Clear Clear Clear Clear Clear Clear Clear Clear
$2,177.00 0.5 VVS1 F Ideal 6 5 3
$3,361.00 0.78 VVS2 G Ideal 5 4 3
$1,795.00 0.51 VS1 F Ideal 4 5 3
$3,023.00 0.7 VS1 F Ideal 4 5 3
$9,821.00 1.26 VS2 F Ideal 3 5 3
$2,137.00 0.82 SI2 J Ideal 1 1 3
more... more... more... more... more... more... more... more...

Recent Comments

robschoen says

I used this data to do a regression model for estimating the value of a round shaped diamond based on its weight, clarity, color, and cut.

The model I came up with was:

log(price) = 7.784 + 2.032*log(carats)
+0.113*clarityCode + 0.102*colorCode + 0.030*cutCode

This model has an adjusted R-squared of .96. The remaining variance is possibly explained by some additional factors not included in the dataset, such as symmetry, polish, and fluorescence. I tried using dummy variables for each of the codes, but was surprised how little this increased the model's explanatory power.

It's a log-model, so the errors are larger in absolute dollars when estimating more valuable dollars, but the percentage size of the errors is fairly consistent.

posted 10 months ago

Popular Graphs

Column Summaries

Show columns: 1 - 4 5 - 8
Price
Clear

$9,956.00
$4,578.74
$1,000.00
$2,825,080.00
2514.25
Carats
Clear

1.72
0.875737
0.43
540.33
0.27
Clarity
Clear
Count of Clarity

see all graphs

7
IF … VVS2
Color
Clear
Count of Color by Color

see all graphs

7
D … J

No comments: