This seems odd to me... that would mean that a math copro for a 486 would cost more then an actual 486 sx chip, no? Was this actually the case?
what? do you think that the 486SX was different from the 486DX? (well later it was but...)
in chip manufacturing, most chips have flaws, but much/most of the chip exists in duplicate, so if one part is flawed, another is taken
with the 486, intel originally made only one chip, if the FPU was broken, instead of throwing the chip away, they simply disabled it and sold it as a 486SX, then for the 487 they just add a full 486DX and disabled the original chip (many hobbyists expiremented with re-enabling the original SX chip, to give them a dual-CPU system)
this technique allowed them to design and manufacture only a single chip design, which not only reduces the significant cost of design, it also gives extra advantage in scale -- producing more identical chips, reduces the per-chip cost
however, late-era chips were separate -- the 487 was rebuilt to stop attempts to re-enable the SX chip, and the SX chip was later redesigned to eliminate the FPU, reducing the die size, but by that time, most people were just getting the DX anyway as prices had been reduced
the same thing is also done (by both intel and AMD) with clock speeds -- the chips are made then tested under extreme conditions, and the best chips are labeled as higher clock speeds, and iirc the same is done with cache on newer chips -- most of the transistors in modern chips are actually part of the cache, and when a flaw appears in the cache, part of it is disabled, and its sold as a cheaper model with less cache (actually, itanium is designed to auto-disable cache if it goes bad and the chip keeps running, so you never know how much cache you are getting when you buy the chip) this is also how the original celeron came to be -- disable the cache on a PII/PIII and you have a celeron