Sampling and Weighting with DHS Data

12 thoughts on “Sampling and Weighting with DHS Data”

Seman Kedir Osman

May 15, 2017 at 7:14 am

Dear
I still have confusion in how to weighting DHS sample. I also had poor internet access in Ethiopia to follow the videos in examples. How would you help me in order to weight the Ethiopian 2011 DHS.

Reply
- The DHS Program
  
  May 15, 2017 at 12:22 pm
  
  Hello Seman, the DHS user forum (http://userforum.dhsprogram.com/) is a great way to ask questions and receive feedback from the broader community and DHS Program staff. We also have the DHS sampling manual (http://dhsprogram.com/publications/publication-dhsm4-dhs-questionnaires-and-manuals.cfm) as an additional resource. Hope this helps!
  
  Reply
Sumonkanti Das

June 8, 2017 at 6:24 am

I wish to calculate mean of height-for-age z-score (HAZ) and their standard error considering the sampling technique. The results I want to estimate by small areas like sub-district/district. I am using 2011 BDHS data and trying to calculate mean HAZ and mean of (HAZ < -2.00 SD) according to sub-district.
I am using STATA command as below. Fortunately we get reasonable results for district but unrealistic results by sub-district, particularly for small size sub-district. For some sub-districts, the mean HAZ becomes zero with zero standard error and similar is observed for HAZ= 601.

gen HW70n= HW70/100.

g COSUBDIST= CODIST*100+ COTHANA

svyset [pw= V005_rewtd], psu (V001) strata (V023)

univar HW70n, by (COSUBDIST)

tabstat HW70n , by(COSUBDIST) stat(n, mean semean)

# HAZ < -2.0

g HW_2=0.
replace HW_2=1 if HW70n <= -2.
tabstat HW_2 , by(COSUBDIST) stat(n, mean semean)

# Results are like theses.

OSUBDIST | N mean se(mean) sd
———-+—————————————-
108 | 18 .0555556 .0555556 .2357023
114 | 15 .0666667 .0666667 .2581989
134 | 14 .0714286 .0714286 .2672612
156 | 18 .1666667 .0903877 .3834825
160 | 17 .1176471 .0805474 .3321056
177 | 18 .0555556 .0555556 .2357023
373 | 16 .125 .0853913 .341565
409 | 25 .16 .0748331 .3741657
428 | 32 .03125 .03125 .1767767
447 | 8 0 0 0
485 | 11 .0909091 .0909091 .3015113
602 | 8 0 0 0
603 | 11 0 0 0
607 | 21 .1904762 .0878052 .4023739
610 | 23 .0434783 .0434783 .2085144
632 | 12 .0833333 .0833333 .2886751
636 | 15 .1333333 .0908514 .3518658
651 | 107 .0747664 .0255462 .2642517
662 | 19 .1578947 .085947 .3746343

Can you explain why I get such results of zero? Can I do such spatial analysis in such way?

Regards,
Sumon

Reply
- The DHS Program
  
  June 8, 2017 at 9:49 am
  
  Great question! I recommend posting this in our user forum. The forum is regularly monitored to ensure all questions are answered by our knowledgeable staff.
  
  Reply
KA

September 4, 2018 at 10:12 am

Hello DHS Program!
How to deal with non-response, when an entire cluster is dropped (for instance due to security or inaccessibility or bad data)? Shall that cluster be included in household response rate? why? how?

Why such this in not discussed in the internet?

Reply
NN

April 4, 2019 at 2:00 am

Hello!

This question is not in reference to DHS but I’m hoping to get help from someone through this forum.

How can we use weights on a data set that is originally representative at provincial level and make it representative at district level?
Since the variables I’m using are at a district level, using a provincially representative data set will result in a sampling bias. So I’m trying to somehow weight the data in a way that it becomes representative at the district level.

It would be of immense help if i can get a response for my query.

Reply
- Sally Zweimueller
  
  April 5, 2019 at 2:22 pm
  
  This is a great question for The DHS Program User Forum. Visit the “Weighting Data” thread to post your question or look for help. https://userforum.dhsprogram.com/index.php?t=thread&frm_id=33&
  
  Reply
Joshua

April 14, 2020 at 5:48 am

Is it okay to apply svy in multi country DHS data analysis

Reply
- The DHS Program
  
  April 20, 2020 at 2:31 pm
  
  This is a great question for The DHS Program User Forum. Visit the “Weighting Data” thread to post your question or look for help. https://userforum.dhsprogram.com/index.php?t=thread&frm_id=33&
  
  Reply
Anna Belli

March 17, 2021 at 4:19 am

Is it possible to have a different total (of in-migrants for instance) when using or not the weights? I am using the HH weights and the PR dataset in order to find the number of in-migrants moving from their place of birth to their place of residence. But, using the weights, the number of total in-migrants is different from the one without the weigths.

Reply
Amanuel

May 21, 2021 at 6:40 am

Hello !
How can I weight for three merged surveys of a country (ZDHS) to calculate U5M using STATA and R commands?

Regards,
Amanuel

Reply
- Sally Zweimueller
  
  July 1, 2021 at 2:35 pm
  
  This a great question for The DHS Program User Forum: https://userforum.dhsprogram.com.
  We also recommend watching our GitHub tutorial video playlist: https://www.youtube.com/watch?v=Q_FZogyugmI&list=PLagqLv-gqpTMf-DP0QyGOqklG0n5pz9AJ.
  Or check out our tutorial video series on sampling & weighting: https://www.youtube.com/watch?v=DD5npelwh80&list=PLagqLv-gqpTN8IZQBy7vAYw10NjynAn2Z.
  
  Reply

Sampling and Weighting with DHS Data

Written by: Mahmoud Elkasabi

Author

12 thoughts on “Sampling and Weighting with DHS Data”

Leave a Reply Cancel reply