Skip to content

jessamynsmith/DOL-ILAB

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DOL-ILAB Data Manipulation for Bureau for International Labor Affairs

The development language is Python 2.7.9. The core data standard used is XML. MySQL is the database used.

The mission of this project is to convert the data provided by the Office of Forced and Child Labor (in DOL), hereafter called OFCL, to machine-readable forms.

The data provided by OFCL is:

  • Three excel spreadsheets, which contains 1) goods, 2) products, 3) master data for the countries monitored by OFCL
  • Around 150 Word documents that describe information on the countries that the office monitors.

The difference between goods and products is a legsilative one. Products are defined those things that the Bureau are 100% sure involve forced labor.

The 'sample code' in the code folder enables the processing of the data in the goods.xls (and products.xls) into an array of goods (and products).

generate_good.py does the conversion from an Excel spreadsheet to an array - goods_list. It also creates XML files located in the output folder (ordered by country or by good). It also produces a raw dump of the spreadsheet into JSON.

generate_product.py does the same thing as generate_good.py but for products.

generate_master_data does the same spreadsheet to internal structure and XMl for the file named Master_Data_Table_for_TDA2014_30_jan_2015.xlsx

generate_db generates a MySQL db and dump the information in the spreadsheets. This is ongoing work.

XML Formats are:

-- Products By Country -- <Product_List> <Year_Name></Year_Name> <Country_Name> </Country_Name> <Product_Name></Product_Name>

		</Products>
	</Country>
	...
</Year>
...

</Product_List>

-- Products By Product -- <Product_List> <Year_Name></Year_Name> <Product_Name> </Product_Name> <Country_Name></Country_Name> ... ... ... </Product_List>

----- GOODS

-- Goods By Country -- <Good_List> <Year_Name> </Year_Name> <Country_Name> </Country_Name> <Good_Name> </Good_Name> <Child_Labor> </Child_Labor> <Forced_Labor> </Forced_Labor> ... ... ... </Good_List>

-- Goods By Good -- <Good_List> <Year_Name></Year_Name> <Good_Name> </Good_Name> <Country_Name> </Country_Name> <Child_Labor> </Child_Labor> <Forced_Labor> </Forced_Labor> ... ... ... </Good_List>

-- Master Data --

<Master_Data> <Year_Name> </Year_Name> <Survey_Name> </Survey_Name> <Childrens_Work_Statistics> <Survey_Source> </Survey_Source> <Age_Range> </Age_Range> <Total_Child_Population> </Total_Child_Population> <Total_Percentage_of_Working_Children> </Total_Percentage_of_Working_Children> <Total_Working_Population></Total_Working_Population> </Childrens_Work_Statistics> <Education_Statistics_Attendance_Statistics> <Age_Range> </Age_Range> </Education_Statistics_Attendance_Statistics> <Children_Work_And_Studying> <Age_Range> </Age_Range> </Children_Work_And_Studying> <Unesco_Primary_Completion_Rate> </Unesco_Primary_Completion_Rate> ... ... </Master_Data>

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published