the digital economy â¢Innovation Union: It consists of over 30 action points aimed at boosting research
initiatives of the Europe 2020 Strategy, has the goal of creating a flourishing digital economy by 2020
The available data indicates that while the FDI sector is of vital importance to the growth of exports
Subdivision of data into North & South Tipperary areas not available 28 Case study: Benefi ts of participation in EU Projects
-refining & Bioenergy, Data Analytics and Manufacturing Research. Two Research Centres are hosted by the University of Limerick:
c) Data pertaining to employment, turnover and exports in the Region from the Central Statistics
A national clustering policy is essential to provide support and structure to cluster development. Additionally, there is a requirement for policies to ensure the creation of the type of
development and the regional benefits of clustering â¢To carry out the necessary prioritisation research and analysis in the Midwest Region to identify
Baseline Data: The initial sections of this Strategy form a baseline assessment of the current research
â¢Priority Area B-Data Analytics, Management, Security & Privacy â¢Priority Area C-Digital Platforms, Content & Applications
the lack of up-to-date data, in particular in developing countries. ITU is joining the international statistical community in looking into ways of using new and emerging data sources â such as those
associated with big data â to better provide timely and relevant evidence for policy-making. Calls for
a âoedata revolutionâ are prominent in the international debates around the post-2015 development agenda, and ICTS have an important role to play in view of their capacity to produce,
huge amounts of data, as well as being a major source of big data in their own right. Big data from
mobile operators, for example, are real-time and low-cost and have one of the greatest development potentials in view of the widespread use and availability of mobile networks and services.
This report provides the reader with a comprehensive and critical overview of the role of big data from the
telecommunication sector, for use in social and economic development policy and for monitoring the future information society
I trust that the data and analysis contained in this report will be of great value to the ITU membership
The 2014 edition of the Measuring the Information Society Report was prepared by the ICT Data and
and Michael Minges to the compilation of data on international bandwidth, revenue and investment. Helpful inputs and suggestions were received
The report includes data from Eurostat, OECD, IMF, Informa, the UNESCO Institute for Statistics, the
ITU also appreciates the cooperation of countries that have provided data included in this report The report was edited by Anthony Pitt and Bruce Granger, ITU English Translation Section.
Chapter 5. The role of big data for ICT monitoring and for development...173 5. 1 Introduction...
5. 2 Big data sources, trends and analytics...175 5. 3 Telecommunication data and their potential for big data analytics...
181 5. 4 Big data from mobile telecommunications for development and for better monitoring...185 5. 5 Challenges and the way forward...
195 Chapter 5 Annex...207 List of references...213 Annex 1. CT Development Index (IDI) methodology...
Annex 2. ICT price data methodology...231 Annex 3. Statistical tables of indicators used to compute the IDI...
5. 1 The five Vs of big data...176 5. 2 An overview of telecom network data...
182 5. 3 Customer profiling using telecom big data...184 xii List of boxes 1. 1 Final review of the WSIS targets:
Achievements, challenges and the way forward...26 1. 2 A decade of successful international cooperation on ICT measurement...
1. 4 What is a data revolution?..30 2. 1 ITU discussion forums on ICT statistics...
5. 1 How big data saves energy â Vestas Wind Systems improves turbine performance...177 5. 2 How Twitter helps understand key post-2015 development concerns...
5. 3 How mobile operators currently use data to track service uptake, business performance and
5. 4 Using mobile data for development...187 5. 5 How mobile network data can track population displacements â an example from the
2010 Haiti earthquake...188 5. 6 Leveraging mobile network data for transportation and urban planning in Sri lanka...
189 5. 7 Poverty mapping in CÃ'te dâ Ivoire using mobile network data...190 5. 8 Using mobile-phone data to track the creditworthiness of the unbanked...
191 5. 9 Using mobile big data and mobile networks for implementing surveys...193 List of tables
1. 1 Rural population covered by a mobile-cellular signal, 2012.4 1. 2 Total Internet domain registrations by world region, 2003,2008 and 2013.18
2. 1 IDI values and changes, 2012 and 2013.41 2. 2 ICT Development Index (IDI), 2012 and 2013.42
5. 1 Sources of big data...175 1 Measuring the Information Society Report 2014 Chapter 1. Recent information society
The data also show a continuous increase in Internet usage, with growth in the number of
and data) market segments, and considering both subscriptions and household access data. This will be followed by a presentation of the latest
trends in terms of investment and revenue in the telecom sector. Then, a number of key indicators
WSIS+10 review, the demand for a data revolution and the role of big data for ICT monitoring
1. 2 The voice market In line with developments in recent years, fixed telephony is on the decline in all regions of the
subscription or SIM CARD â the total numbers and growth rates strongly point to market saturation Whether this will change in the near future, and
ITU World Telecommunication/ICT Indicators database 39.2 26.3 24.9 15.8 12.7 8. 7 1. 3
data reveal, however, that digital divides still exist and that some people are excluded still from access to communication networks
ITU World Telecommunication/ICT Indicators database 162.7 124.7 109.9 108.5 96.4 89.2 69.3 0 20
For countries where data are available, the number of mobile subscriptions far exceeds the number of mobile phone users (Partnership, 2014
many developing countries for which data are available, but is closing with the availability of
Further research and data would be necessary to determine peopleâ s access to, and use of, voice
Partnership (2014) based on ITU data Overall mobile-cellular population coverage %Rural population covered %Rural population
ITU World Telecommunication/ICT Indicators database 27.7 16.7 14.3 9. 8 7. 7 3. 1
The data on fixed-and mobile-broadband uptake confirm what has been observed on the ground. In developed countries, fixed
ITU World Telecommunication/ICT Indicators database 83.7 32.0 21.1 6. 3 0 10 20 30
smartphone, tablet) and SIM CARDS Looking towards the future, the growth potential for mobile broadband looks promising, as
data based on ITU and Telecom Advisory Services calculations more and more countries upgrade their mobile networks. As mentioned earlier, 2g population
Data on 3g population coverage are less available. According to ITU estimates, global 3g population coverage stood at around 50 per cent
New data collected by ITU on the deployment of fibre transmission capacity in countries shows
at the data also reveals major disparities across regions: Asia and the Pacific (in particular China
Partnership (2014) based on ITU data Percentage of rural population covered by at least a 3g mobile
delivering data-intensive applications and services through high-speed networks. While fibre transmission networks constitute an
ITU World Telecommunication/ICT Indicators database connectivity, because of the strong internal demand and also its location:
ITU World Telecommunication/ICT Indicators database 221 420 1'213 702 4'384 11'572
The latest ITU data show that by end 2014, almost 44 per cent of the worldâ s households will have
ITU World Telecommunication/ICT Indicators database 78.0 57.4 53.0 43.6 36.0 35.9 11.1 0 10
data are available, rural household access falls far below urban household access, with differences ranging from 4 per cent (meaning
2014). 8 Available data also show that Internet access in rural households is growing slowly
but data are not readily available for those countries. As has been illustrated earlier, network deployment is still
Data on public access is collected by ITU through its household questionnaire, but only few countries
report data on this indicator, let alone broken down by urban and rural populations. Public access can be provided by commercial facilities
ITU World Telecommunication/ICT Indicators database 78.3 40.4 32.4 8. 0 %0 10 20 30
The data include both global top-level domain (gtld) and country code top-level domain (cctld) registrations
Data sourced from Facebook 0 500 1'000 1'500 2013 2012 2011 2010 2009
International data on ICT access and use by enterprises are collected annually by the United Nations Conference on Trade and Development
In the developing world, data on ICTS in enterprises are scarce and only collected by few
Data supplied by Zooknic, compiled from cctld and other sources. Figures exclude fifteen cctlds which act as virtual
There is little data on the use of ICTS by government organizations, and those countries that do have data are usually the more advanced
ones with high levels of connectivity in general More information is available about government services provided online, tracked by the United
The latest data show that, today, governments of all countries have established central websites and that more
UNCTAD Information Economy Database, 2014, available at unctadstat. unctad. org P e rc e n
Data from United nations E-government Survey (2014 Chart 1. 21: E-government services provided by countries (transactional services, left, and e-participation
Data from United nations E-government Survey (2014 101 73 60 46 44 42 41 40
exchange of forms, data and other information Internet access in schools also achieves cost efficiencies by automating manual tasks and
The latest available data from the UNESCO Institute for Statistics (UIS) 22 show that, in
developed countries for which data are available In developing countries, school access to Internet is lower on average, although much
Data on broadband in secondary schools in Bangladesh are not available. Data for Nicaragua, Philippines and Indonesia do not include upper
secondary. Data for European countries and Costa rica refer to lower secondary. Data for Guyana, Nicaragua and Indonesia refer to primary and lower
secondary. Data for Cambodia include pre-primary schools. Data for Morocco, Tunisia, Guyana, Montserrat, Dominican republic, Nicaragua, Colombia
Trinidad and tobago, Bangladesh, Philippines, Sri lanka, Azerbaijan, Bhutan, Cambodia, Kazakhstan, Malaysia, Maldives, Singapore, Belarus and the Russian Federation refer to public schools.
In Suriname, there are no private schools in upper secondary. Data for Palestine refer to West bank schools
only Source: UIS database, Partnership on Measuring ICT for Development WSIS Targets Questionnaire, 2013 P
e rc e n ta g e o f s c h o o ls
Americas 0 10 20 30 40 50 60 70 80 90 100 Internet Fixed broadband Internet
Data for Philippines refer to primary and lower secondary. Data for Venezuela refer to primary only.
Data for Montserrat and Saint lucia refer to secondary only. Data for Palestine refer to West bank schools only.
Data for Bahrain, Belarus, Morocco and Tunisia refer to 2008. For Morocco, ICT -qualified teachers figures refer to 2008.
Data for Azerbaijan, Barbados, Jordan, Saint lucia, Singapore, Trinidad and tobago, Uruguay, Philippines and Sri Lanka refer to public schools only
Source: UIS database, Partnership Questionnaire on WSIS Indicators, 2014 Proportion of teachers trained to teach basic computer skills (or computing
%P ro p o rt io n o f te a c h e rs
t ra in e d t o t e a c h s u b
Available data collected by UIS at the international level shows that education systems in countries seem to put more emphasis on
document also includes two targets on data and monitoring and stresses the âoeneed to take urgent
availability of disaggregated data to ensure that no one is left behindâ. 26 The role of ICTS as a key development enabler
collect ICT data at the national level Monitoring progress towards achievement of the WSIS outcomes has been an integral component
indicators and collecting data and statistics on ICT. Since its creation in 2004, the Partnership has
national-level data on the indicators identified to measure the WSIS targets. The results of the survey are featured in the report
along with other data sources The Final review of the WSIS targets: Achievements, challenges and the way forward report is available at:
Using different data sources and big data analytics An important element in the discussion related
to the post-2015 development agenda and the setting of measurable goals and targets has
data and statistics are collected, analysed and disseminated, in view of the large data gaps prevailing in many developing countries in basic
statistics in the areas of the economy, health education, labour, etc. all of which are crucial to
see Box 1. 4 on the data revolution. Since then it has received considerable attention within the
development data According to the discussions, a data revolution considers new data sources, in addition to existing official sources:
besides governments other stakeholders such as the private sector, civil 27 Measuring the Information Society Report 2014
Box 1. 2: A decade of successful international cooperation on ICT measurement In 2004, ICT measurement was still in its infancy, with little
calls surfaced for reliable and comparable data in order to take stock of the emerging information society, identify digital
of their national ICT data collection. The methodological work developed by the Partnership has contributed significantly to
Data availability has increased also significantly over the past decade. In particular, there are more comparable data on ICT
infrastructure, household access and Internet users. For example at the beginning of the century, only around a dozen developing countries collected data on Internet users, while today there are
almost 50 developing countries collecting this indicator through official surveys (Chart Box 1. 2). Data on household access to
the Internet or a computer are now being collected by more than 100 economies worldwide, and data on Internet use in
businesses by almost 70 countries, although not on a regular basis (Partnership UNSC 2011. Similarly, whereas no data were
available on ICT access and use in schools, they have started now to be compiled in many developing countries (see section 1. 5
At the same time, major data gaps remain, in particular in developing countries and LDCS. This concerns, notably, statistics
public-sector organizations, ICT-related employment data, as well as data related to online security and cybercrime, gender
and youth, and cultural and environmental aspects. The growing information society will increasingly require more and better
Internet user data through official surveys, by level of development Note: Chart shows countries that have collected data on the number of
Internet users through official national surveys. Data are presented in three-year intervals and include countries that have collected
data for at least one year within those intervals Source: ITU 0 2 11 24 36
49 1 3 27 37 41 40 1 5 38 61 77 89 0 20
New data sources could include big data mostly provided by private-sector companies which could help âoeimprove the timeliness and
completeness of data, without compromising the relevance, impartiality and methodological soundness of the statisticsâ (UNSC, 2014
The topic of big data is gaining momentum in the statistical community. Chief statisticians gathering at the UNSC meetings in 2013
recognized that âoebig data constitute a source of information that cannot be ignored by official statisticiansâ and that âoeofficial statisticians must
surveys in a number of countries, big data could provide important sources of more timely and
contributions from Member States. c Due to data limitations, currently mobile-broadband signal coverage is considering in determining this target
d Data being compiled by the Global Cybersecurity Index (GCI Source: ITU 29 Measuring the Information Society Report 2014
potentially becoming big data sources as well At the UNSC meeting in 2014, the commission reiterated its call for the global statistical
the use of big data for official statistics. The commission requested the group to include the
of big data for official statistics at regional subregional and national levels â¢To address the concerns of methodology
data and legislation related to big data â¢To address the issue of obtaining âoeaccess at no costâ to big data from the private sector
for official statistical purposes, as well as the issue of access to transborder data or access to data on transboundary phenomena
â¢To develop guidelines to classify the various types of big data sources and approaches â¢To develop methodological guidelines
related to big data, including guidelines for all the legal aspects â¢To formulate an adequate communication strategy for data providers and users on the
issue of use of big data for official statistics â¢To reach out to other communities especially those more experienced
in IT issues or in the use of open data platforms The UN Global Working group on Big data for
Official Statistics was launched formally in June 2014, under the auspices of the UN Statistics Division.
The mandate of the group, of which ITU is a member, includes: provide strategic vision direction and coordination of a global programme
on big data for official statistics; promote practical use of sources of big data for official statistics
provide solutions for methodological, legal and privacy issues; promote capacity building; foster communication and advocacy of the use of big
data for policy applications; and build public trust in the use of private-sector big data for official
statistics ICTS are part of the debate on the data revolution big data and, more broadly, emerging data issues
in the post-2015 development debate. First the ICT sector in itself represents a new source
of data, provided by, for example, Internet and telecommunication companies. Second, the spread and use of ICTS allow public and private
entities across all economic sectors to produce store and analyse huge amounts of data. At the
same time, however, monitoring access to and use of ICTS by people, public entities and private
enterprises will be essential in order to identify the extent to which stakeholders in the ICT sector can be used as an alternative data source.
Without ICTS, no ICT-driven data revolution will take place In view of the link between big data and ICTS, work
is under way in ITU with a view to contributing to the debate and identifying new ways and means
of exploiting the potential of big data. The focus is primarily on the telecommunication/ICT sector
as a source of big data, including players such as operators and service providers, in the fixed
mobile and Internet sectors. Delegates attending the eleventh World Telecommunication/ICT Indicators Symposium (WTIS) in Mexico city in
December 2014 recommended that ITU should further examine the challenges and opportunities of big data, in particular data coming from ICT
companies; that regulatory authorities should explore the development of guidelines on how big data could be produced, exploited and stored
and that national statistical offices, in cooperation with other relevant agencies, should look into the opportunities for big data and address
current challenges in terms of big data quality Chapter 1. Recent information society developments 30 veracity and privacy within the framework of the
fundamental principles of official statistics. 33 The big data approach taken by ITU so far focuses on the following areas and questions
Standardization: 34 â¢Which standards are required to facilitate interoperability and allow technology integration in the big data value chain
â¢Which definitions, taxonomies, secure architectures and technology roadmaps need to be developed for big data analytics and technology infrastructures
â¢What is the relationship between cloud computing and big data in view of security frameworks â¢Which techniques are needed for data
anonymization for aggregated datasets such as mobile-phone records â¢How is exploited big data in different industries; what are the specific
challenges faced; and how can these challenges be addressed through international standards Regulation: 35 â¢What are the key regulatory issues at stake
and how can and should big data be regulated â¢How does big data impact on the
regulation of privacy, copyright and intellectual property rights (IPR transparency and digital security issues Box 1. 4:
What is a data revolution The report of the High-level Panel of Eminent Persons on the
Post-2015 Development Agenda to the UN Secretary-general which was published in May 2013, has called for, inter alia, a data
revolution taking advantage of new technology and improved connectivity: âoewe also call for a data revolution for sustainable
development, with a new international initiative to improve the quality of statistics and information available to people and
levels on what such a data revolution could entail and how it could be implemented. While no internationally agreed concept
part of a data revolution: 32 â¢In view of the ubiquitous availability of communication networks, the use of new information technologies (e g
the collection and dissemination of data â¢Data should be further disaggregated (by gender income, age, geography, etc.
to ensure that no one is left out; in this regard, traditional statistical processes should be made more efficient
â¢The focus should go beyond data dissemination and also include investment in the development of concepts
â¢New, nontraditional data sources should be explored and leveraged to complement existing ones and satisfy the demand for data needs in new areas, such
as big data, geospatial information and geographical information systems â¢Open data policies should be envisaged to ensure
accountability and promote transparency â¢The role of data, statistics and monitoring for policy -making and decision-making should be increased
31 Measuring the Information Society Report 2014 â¢What is the link between big data and open data (crowdsourcing, cloud
computing, etc â¢is there need a to regulate data management and service providers â¢How can market dominance in the area
of big data be prevented and the rights of the data owners protected ICT data collection and analysis
â¢How can big data complement existing ICT statistics to better monitor information-society developments â¢Which type of data from ICT companies
are most useful and for which purposes â¢Which new ICT indicators could be produced from big data sources
â¢What are the key issues that need to be addressed, and by whom, in terms of
collecting and disseminating big data in telecommunications â¢What is the role of national statistical offices and how can big data
complement official ICT data â¢How can big data from telecommunications inform not only ICT but broader development policy in
real time, leading to prompt and more effective action Chapter 5 of this report addresses some of
these questions and provides suggestions and recommendations for the way forward Chapter 1. Recent information society developments
32 1 Refers to countries where fixed-telephone penetration increased by more than 1 per cent in 2014
2 See https://gsmaintelligence. com /3 http://www. censusindia. gov. in/2011census/hlo/Data sheet/India/Communication. pdf
4 4g refers to fourth-generation mobile network or service. It is a mobile-broadband standard offering both mobility and very
high bandwidth, such as long-term evolution (LTE) networks (ITU Trends 2014 5 Data collection on Europe and North america will follow in 2014
6 For a list of IXPS, see for instance http://www. datacentermap. com/ixps. html 7 For more details on international submarine fibre-optic links, see Telegeographyâ s Submarine cable Map 2014, available at
http://submarine-cable-map-2014. telegeography. com 8 For further discussion on progress made towards connecting rural households to the Internet,
IMF World Economic Outlook Database, April 2014 12 Source: The Economist, April 12 2014, âoenigeriaâ s GDP step changeâ
19 Data refer mostly to the year 2011 20 The UN E-government Development Index is a composite benchmarking indicator based on a direct assessment of the state
elements that should be part of a data revolution 33 See final report of WTIS-13, available at:
34 For further information on the work on big data carried out by the ITU Telecommunication Standardization Bureau (TSB),
http://www. itu. int/en/ITU-T/techwatch/Pages/big data-standards. aspx 35 A background document on big data that was prepared for GSR-14 is available at
http://www. itu. int/en/ITU-D/Conferences/GSR/Pages/gsr2014/default. aspx 35 Measuring the Information Society Report 2014
better data become available. For example what was considered basic infrastructure in the past â such as fixed-telephone lines â is fast
â¢Data availability and quality. Data are required for a large number of countries as the IDI is a global index.
There is relative paucity of ICT-related data, especially on ICT usage, in the majority of developing
countries. In particular, the three indicators included in the skills sub-index should be considered as proxies until
data directly relating to ICT skills become available for more countries â¢The results of various statistical analyses.
data and to explore whether the different dimensions are statistically well-balanced While the basic methodology has remained the
related data availability, the indicators included in the IDI and its sub-indices are under regular
field of ICT statistics and data collection, work through online discussion forums and annual face-to-face meetings.
-SIM ownership is driving up mobile-cellular subscription numbers, which is an issue in prepaid and, to a lesser extent, also in postpaid
data with data based on national household surveys (demand-side indicators. An indicator such as the percentage of individuals using a
reported these data to ITU for at least one year between 2011 and 2013. It is therefore too
subscription data in the IDI with mobile-phone user data. In view of the methodological difficulties in collecting harmonized data on
international Internet bandwidth, a review of the definition of the indicator is currently under discussion in EGTI
The IDI was computed using the same methodology as in the past, applying the following steps (Figure 2. 2 and Annex 1
â¢Preparation of the complete data set. This step includes filling in missing values using various statistical techniques
â¢Normalization of data. This is necessary in order to transform the values of the IDI indicators into the same unit of
â¢Rescaling of data. The data were rescaled on a scale from 0 to 10 in order to
compare the values of the indicators and the sub-indices â¢Weighting of indicators and sub-indices.
This chapter presents the IDI based on data from 2013 in comparison with 2012. It should be
â¢Country data revisions. As more accurate data become available, countries provide ITU with revised statistics for
previous years, which have been taken into consideration. This also allows ITU to identify inconsistencies and revise
added based on data availability. Overall this version of the IDI includes 166 countries/economies as compared with
According to data from the European union EU), 85 per cent of Danes have some level of
ITU World Telecommunication/ICT Indicators database 33 35 36 38 40 110 75 87 105
ITU World Telecommunication/ICT Indicators database have access to mobile broadband at speeds of at
Data also show that the Republic of korea achieves the highest advertised fixed-broadband speeds, with all
domestic demand for data driven by the high volume of local content, and domestic Internet
Data from the EU confirm that household access is highly correlated with regular use of the Internet, which
Data from the European Commissionâ s Digital Agenda underline the competitiveness of the European fixed
ITU World Telecommunication/ICT Indicators database 0. 0 0. 1 0. 3 1. 0 2. 5
ITU World Telecommunication/ICT Indicators database 91.5 88.1 97.2 96.4 0 10 20 30 40
mobile data plans. 35 During 2013, operators further extended their wireless infrastructure and services throughout Thailand,
Based on the 2013 and 2012 data presented in this chapter, the current (2013) global divide is
Data change very little over time and advances in skills do not show immediate effects
Data not available 59 Measuring the Information Society Report 2014 the profitability of various kinds of economic
selected for their high data availability for a large number of countries Past editions of this report (see MIS 2013) have
Data on urban population, population density, land area and GNI p. c. are sourced from the World bank.
Data on population size are sourced from UNPD.**Correlation is significant at 0. 01 level
pronounced when it comes to data on Internet access and use. Access to the Internet (be it
data are available for both sets of indicators. The following steps were performed â¢First, developed countries were excluded
in respect of which data pertaining to both the MDG indicators and the IDI are available for at least 16 countries. 41 The MDG
147 economies for which data were available for 2002 and 2013 shows that the global IDI value has doubled almost from 2. 52
would require different sets of data (including micro data) collected from official surveys Therefore, the analysis should be considered as a
first step in an attempt to quantify the relationship between ICT performance and MDG progress. The
data are available â¢Almost all MDG indicators that are included under MDG 1, MDG 4, MDG
children, through the analysis of health data collected through public health applications and by serving as a platform for exchange and
of smartphones to capture essential data on the patients and monitor their treatment has accelerated progress.
of MDG indicators, where 2002 and 2011 data are available for both sets of indicators.
data are available for 2002 and 2011 for both the MDG indicators and the IDI
earlier section where 2011 data were analysed improvements in the level of ICT access and use between the ten-year periods have shown
analysis will require different data sets, including micro data on ICT usage collected from official
surveys. Micro data offer analysts and researchers ample information and considerable flexibility to apply quantitative models that identify
relationships and interactions between indicators and topics covered in a survey, thereby fostering the diversity and quality of research and analysis
2 Data on the indicators included in the skills sub-index are sourced from the UNESCO Institute for Statistics (UIS.
http://data. worldbank. org/about/country-classifications/country-and-lending-groups#High income 7 Based on 2011 data:
https://ec. europa. eu/digital-agenda/sites/digital-agenda/files/DAE%20scoreboard%202013%20-%20
19 In these countries, the in-scope population for data on Internet users is aged individuals 16-74
as well as the household access data, excludes transient labourers, which account for a significant proportion of residents in Qatar.
According to data from ICTQATAR, âoetransient labourersâ make up 27 per cent of the overall population 33 http://qnbn. qa/qatar-vision-2030
35 http://www. telecompaper. com/news/thai-operators-reduce-prices-of-smartphone-data-plans--900198
World bank, see http://data. worldbank. org/indicator/NY. GNP. PCAP. CD 39 See: Czernich, N.,Falck, O.,Kretschmer, T. and Woessmann, L. 2009), Broadband Infrastructure and Economic growth
if there is available data for at least one year between 2010 and 2012. The number of countries included in the analysis varies from 16 to 101 developing countries.
since it is the closest year to available MDG data 42 Broadband Commission. Transformative Solutions for 2015 and Beyond a Report of the Broadband Commission Task force
ITU World Telecommunication/ICT Indicators database 0 20 40 60 80 100 120 140 160
Data from South African operators show that not only is wireless -broadband penetration reaching higher levels
consuming more data, indicating an increase in the intensity of usage. MTN reported a growth
of 63 per cent in data volumes in the first half of 2013 and Vodacom reported that on average
users were generating 75 per cent more data traffic per device than a year ago. 5 Wireless
and high levels of multi-SIM ownership (GSMA and Deloitte, 2013. Furthermore, the very high
Data from household surveys show that the actual number of people using a mobile-cellular phone is
ITU World Telecommunication/ICT Indicators database 2012 2013 P e r 1 0 0 i
Data from the annual ICT household survey show that, since 2008, computers have replaced telephones as
ITU World Telecommunication/ICT Indicators database 2012 2013 P e r 1 0 0 i
with typically high rates of multi-SIM ownership Furthermore, markets are very competitive, with a relatively high number of mobile operators.
Data from household surveys collected in a number of CIS countries underline that mobile -cellular penetration, measured as the number
ITU World Telecommunication/ICT Indicators database 2012 2013 P e r 1 0 0 i
Data from the Eurobarometer underlines this finding on average, 92 per cent of European union citizens (the majority of countries in the region
Data on Individuals using the Internet for Eurostat members are sourced from Eurostat. Eurostat collects data for Internet users aged 16-74 years old
Source: ITU World Telecommunication/ICT Indicators database World Developed 0 10 20 30 40 50
60 70 80 90 100 %101 Measuring the Information Society Report 2014 penetration stands at 6 per cent, all European
ITU World Telecommunication/ICT Indicators database 0 10 20 30 40 50 60 70 80
coverage, and the majority of all data traffic is carried by the LTE network. 27 Very high increases
11 Internet user data from Gulf countries are not comparable, as they refer to different populations.
Data from Bahrain and Qatar refer to the overall population, i e. including expatriate/transient workers.
Data from United arab emirates are estimated by ITU based on base data excluding the transient worker population 12 Reported in activated external capacity
13 http://www. ofca. gov. hk/en/industry focus/telecommunications/facility based/infrastructures/submarine cables/index html 14 http://submarinenetworks. com/systems/intra-asia/sjc/sjc-cable-system
/26 Data reported by the country refer to 2012 27 http://www. verizonwireless. com/wcms/consumer/4g lte. html and http://www. telecompaper. com/news/verizon-wireless
data following a harmonized methodology since 2008. Initially, prices were collected for fixed -telephone, mobile-cellular (voice and SMS) and
Since 2012, the data collection has been extended to include mobile -broadband prices. These data have proved to be
useful for the international comparison of ICT prices across more than 160 countries, and for identifying those cases where prices constitute a
They include end-2013 data for each of the three price sets contained in the IPB (fixed-telephone
Based on 140 economies for which 2008-2013 data on fixed-telephone and mobile-cellular prices were available
-broadband service with a 500 MB monthly data allowance. 7 Despite the limitations of comparing
included in the ITU price data collection exercise Growth rates have been stagnating since 2008 and there have been few structural changes in
price data had already fully or partially liberalized their fixed-telephone market in 2008, compared
Based on 140 economies for which 2008-2013 data on fixed-telephone and mobile-cellular prices were available
*Data correspond to the GNI per capita (Atlas method) in 2013 or latest available year adjusted with the international inflation rates.**
ranked because data on GNI p. c. are not available for the last five years
GNI p. c. and PPP$ values are based on World bank data Rank Economy Fixed-telephone sub-basket GNI p. c
*Data correspond to the GNI per capita (Atlas method) in 2013 or latest available year adjusted with the international inflation rates.**
ranked because data on GNI p. c. are not available for the last five years
GNI p. c. and PPP$ values are based on World bank data Rank Economy Mobile-cellular sub-basket GNI p. c
file sharing (less than 1 per cent of total file -sharing traffic was transmitted through mobile networks in 2013) and Internet video (2 per cent
total mobile data traffic was offloaded onto fixed networks in 2013 (CISCO, 2014), highlighting the role that fixed broadband plays in supporting the
which 2008-2013 data on fixed-broadband prices were available Source: ITU 0 50 100
and data on fixed (wired)- broadband subscriptions by speed, which show that a significant share
Based on 143 economies for which 2008-2013 data on fixed-broadband prices were available Source:
Based on 165 economies for which 2013 data on fixed-broadband prices were available Source: ITU
GNI p. c. values are based on World bank data 0. 0 0. 5 1. 0 1. 5
GNI p. c. values are based on World bank data 0 2 4 6 8 10 12
GNI p. c. values are based on World bank data for fixed-broadband uptake in the country.
GNI p. c. values are based on World bank data 0 5 10 15 20 A s
GNI p. c. values are based on World bank data There are 13 countries in the Asia and the Pacific
Indeed, the latest data on international connectivity show that this may remain an issue in Kiribati (45 Mbit/s), Marshall
GNI p. c. values are based on World bank data However, a comparison with other regions shows
*Data correspond to the GNI per capita (Atlas method) in 2013 or latest available year adjusted with the international inflation rates.**
ranked because data on GNI p. c. are not available for the last five years
GNI p. c. and PPP$ values are based on World bank data Rank Economy Fixed-broadband sub-basket
which ITU collects data on prices (Figure 4. 1 operators are adding plans for specific devices
allowing customers to pool the data consumed by different devices in a single subscription. 23
with neither time nor data volume constraints This is the common scheme in a vast majority
include data volume caps, e g. USD 10 for 50 MB per month. Several operators also offer
of time and data volume limitations, e g. USD 5 for one day of use with a maximum of 50 MB
priced on the basis of the data allowance (i e. the data volume in MB included in the plan) and not
the speed. Many operators do not even advertise the speed of the mobile-broadband service, but
in data allowances, bundled voice minutes and SMS, time limitations, premium speeds, etc rather than actual differences in prices for the
data allowance was PPP$ 36.6 (or USD 24.4) for prepaid plans and PPP$ 30.0 (or USD 19.2) for
-broadband service with 500 MB monthly data allowance was PPP$ 25.3 (or USD 16.9) for
the monthly data allowance was half as large Nevertheless, the reduction in price was not
proportional to the reduction in the data allowance, confirming that the price per GB is
lower for larger data allowances, the equivalent of a volume discount. Unlike in the case of
Based on 119 economies for which data on mobile -broadband prices were available for the four types of plans
data allowance of 500 MB are about eight times more affordable in developed countries than in developing countries, on average (Chart
service with 1 GB monthly data allowance corresponded to more than 20 per cent of GNI
Based on 119 economies for which data on mobile -broadband prices were available for the four types of plans
Based on 119 economies for which data on mobile -broadband prices were available for the four types of plans
much less than 500 MB of Internet data per month, supported by the fact that several African
limited data allowances, and even Internet radio would need to be limited This suggests that, if mobile broadband is to
Based on 119 economies for which data on mobile -broadband prices were available for the four types of plans
Country data for The americas reveal that there are a number of countries which have high prepaid computer
include a monthly data allowance much larger than 1 GB (Table 4. 8). This is the case of Chile
Such high monthly data allowances for prepaid mobile-broadband dongles suggest that these services target high-end customers
data allowances in The americas (Table 3. 7 suggesting that postpaid rather than prepaid is the base offer for regular computer-based
-broadband plans with a monthly data allowance of 1 GB suggest that mobile broadband could be
Percentages are calculated on the basis of the total number of countries with data available in each region:
data allowance of at least 1 GB. Although the minimum data allowance is the same, in practice
most fixed-broadband plans allow unlimited data use (Table 4. 4), whereas most computer -based mobile-broadband plans with a minimum
monthly data allowance of 1 GB really do have a cap of 1 GB (Tables 4. 7 and 4. 8
In almost half of the African countries included in the price benchmark, mobile-broadband prices were more than USD 10 cheaper per
*Monthly data allowance (MB as%of GNI p. c. USD PPP$ 1 Austria 0. 13 5. 31 4. 62 48'590 1'024
*Monthly data allowance (MB as%of GNI p. c. USD PPP$ 80 Maldives 2. 08 9. 7 12.6 5'600 700
*Data correspond to the GNI per capita (Atlas method) in 2013 or latest available year adjusted with the international inflation rates.**
ranked because data on GNI p. c. are not available for the last five years
GNI p. c. and PPP$ values are based on World bank data Chapter 4. ICT prices and the role of competition
*Monthly data allowance (MB as%of GNI p. c. USD PPP$ 1 Norway 0. 1 8. 34 5 102'610 500
*Data correspond to the GNI per capita (Atlas method) in 2013 or latest available year adjusted with the international inflation rates.**
ranked because data on GNI p. c. are not available for the last five years
GNI p. c. and PPP$ values are based on World bank data Rank Economy Mobile-broadband prepaid handset-based (500 MB
*Monthly data allowance (MB as%of GNI p. c. USD PPP$ 80 Antigua & Barbuda 2. 58 27.78 33.8 12'910 1'024
*Monthly data allowance (MB as%of GNI p. c. USD PPP$ 1 Austria 0. 13 5. 31 4. 62 48â 590 1
*Data correspond to the GNI per capita (Atlas method) in 2013 or latest available year adjusted with the international inflation rates.**
ranked because data on GNI p. c. are not available for the last five years
GNI p. c. and PPP$ values are based on World bank data Rank Economy Mobile-broadband postpaid computer-based (1 GB
*Monthly data allowance (MB as%of GNI p. c. USD PPP$ 76 Montenegro 2. 89 17.49 28.93 7â 260 3
*Monthly data allowance (MB as%of GNI p. c. USD PPP$ 1 Austria 0. 13 5. 31 4. 62 48â 590 1
*Data correspond to the GNI per capita (Atlas method) in 2013 or latest available year adjusted with the international inflation rates.**
ranked because data on GNI p. c. are not available for the last five years
GNI p. c. and PPP$ values are based on World bank data Rank Economy Mobile-broadband prepaid computer-based (1 GB
*Monthly data allowance (MB as%of GNI p. c. USD PPP$ 76 South africa 4. 82 28.90 51.17 7â 190 1
to making available more data on household and individual economic welfare, as well as its distribution. This section presents a refined
data on the distribution of household income or consumption expenditure are available The objective is to explore how factors such as
Data on household income, on the other hand measure only peopleâ s economic welfare, and do not include the business sector.
Data are collected by national statistical offices by means of household income and expenditure surveys HIES) or household surveys including a module
classify income data by deciles, individuals are placed in ascending order according to the household income attributed to them, and then
availability of products, may be relevant. 34 Data on household consumption expenditure are also obtained from household surveys
international comparisons based on data on household economic welfare, because there are several methodological issues that limit
-level economic data provide information (not available from macroeconomic indicators) on the actual income and expenditure capacity
previously mentioned, these data can be used to obtain a finer-grain indication of the affordability
Data for the United states and Sweden are sourced from the OECD Database on Income Distribution
Data for South africa and Viet Nam are sourced from the World Bankâ s Povcalnet and refer to 2008
4. 18 uses data on income inequality to reveal differences in the affordability of fixed-broadband
Household disposable income based on World Bankâ s Povcalnet data adjusted with ITU estimates on average persons per household
inequality within countries, economic data at the household level also make it possible to determine more precisely the affordability
Household consumption based on World Bankâ s Povcalnet data adjusted with ITU estimates on average persons per household
and expenditure data, the following conclusions can be drawn â¢In more than 85 per cent of countries
for which data are available, the richest 20 per cent of the population can afford
data are available, a basic fixed -broadband subscription still represents more than 5 per cent of household
countries for which data on household income or expenditure distribution are available, fixed-broadband plans remain
Data on household disposable income refer to 2011 or latest year available.**â Lowest 20%â refers to the price divided by the average income of the first
Household disposable income for OECD countries and the Russian Federation based on data from the OECD Database on Income Distribution
on World Bankâ s Povcalnet data adjusted with ITU estimates on average persons per household Fixed-broadband prices as a%of household disposable income%households
Data on household consumption expenditure refer to 2011 or latest year available.**â Lowest 20%â refers to the price divided by the average expenditure
Household consumption expenditure based on World Bankâ s Povcalnet data adjusted with ITU estimates on average persons per household
limited by lack of data on the distribution of household income/expenditure Available data suggest that basic fixed
-broadband plans are affordable for 90 per cent of the population in Jordan and Tunisia, whereas they are less affordable in
with a 500 MB monthly data allowance) as a percentage of disposable household income and
collects price data (Chart 4. 13), and is currently the mobile-broadband service that is available in most
which data are available. This is also the situation in Armenia, Dominican republic and Egypt In these countries, a prepaid handset-based
subscription data show that, in developed countries, handset-based mobile-broadband subscriptions are individual, rather than shared
Data on household disposable income refer to 2011 or latest year available.**â Lowest 20%â refers to the price divided by the average income of the first
Household disposable income for OECD countries and the Russian Federation based on data from the OECD Database on Income Distribution
data adjusted with ITU estimates on average persons per household Prepaid handset-based mobile-broadband prices
Data on household consumption expenditure refer to 2011 or latest year available.**â Lowest 20%â refers to the price divided by the average expenditure
Household consumption expenditure based on World Bankâ s Povcalnet data adjusted with ITU estimates on average persons per household
the household has his/her own SIM CARD with a mobile-broadband plan. In this case, affordability
Data show that handset-based mobile -broadband prices (with a 500 MB monthly allowance) are affordable for a majority of the
Data on equivalized household income and consumption expenditure per decile refer 2011 or latest available year
Equivalized household disposable income for OECD countries and the Russian Federation based on data from the OECD Database on Income
Equivalized household disposable income and consumption for other countries based on World Bankâ s Povcalnet data adjusted with ITU
Data for the Arab States are limited to four countries. In Tunisia and Jordan, individual handset-based mobile-broadband subscriptions
On the basis of the data presented, it can be concluded that income inequality does not only
of comprehensive data series on the prices for these two services, which makes it possible to study
data for developing countries. Quantitative studies are in several cases restricted to samples of EU and OECD countries, or else
regulatory data for up to 144 countries in the period 2008-2013. Including such a large set
on telecommunication data from EU, OECD and specific countries hold true in a global context
the level of detail of the indicators, since data availability and comparability for such a large
Simple averages for 140 economies with available data on fixed-broadband prices and competition for the period 2008-2013
Herfindahl-Hirschman Index (HHI) data sourced from Informa Chart 4. 22: Competition in mobile markets and mobile
Simple averages for 140 economies with available data on mobile-cellular prices and competition for the period 2008-2013
Herfindahl-Hirschman Index (HHI) data sourced from Informa competition and the openness of the market to private and foreign investment
based on panel data regression. This enables us to go beyond descriptive statistics and draw some robust conclusions on the link between
analyses in this section, data from the Regulatory Tracker have been extracted for clusters 1, 2 and 3, and combined into a single value per country
for up to 144 countries based on data for the five-year period from 2008 to 2013
Panel data regression is a statistical technique which is used to assess how variations in a set
GB of data allowance included per month However, entry-level fixed-broadband plans in several countries offer higher
speeds and larger data allowances. In order to measure how these enhanced features affect prices, a variable on fixed
capped data allowances are included as controls in the model for fixed-broadband prices Chapter 4. ICT prices and the role of competition
Data collected by ITU, see Annex 2 for more details on the methodology for the collection of
Data collected by ITU, see Annex 2 for more details on the methodology for the
Data sourced from Informa Herfindahl-Hirschman Index for mobile cellular (voice and SMS with Li being the number of mobile-cellular subscriptions of firm i,
Data sourced from Informa 159 Measuring the Information Society Report 2014 Box 4. 2: Panel regression models for fixed-broadband and mobile-cellular prices (continued
Data collected by ITU, see www. itu. int/tracker for more information Fixed-broadband speed:
Speed of the entry-level fixed broadband plan in Mbit/s. Data collected by ITU
if the fixed-broadband plan includes a cap in the monthly data allowance, and 0 otherwise.
Data collected by ITU Descriptive statistics of the dependent variables Average Standard deviation Minimum Maximum 2008 2013 2008 2013 2008 2013 2008 2013
Descriptive statistics calculated for 124 economies that have complete data for the two models Source: ITU
Fixed-broadband plans with data caps are linked to prices 31 %lower than unlimited plans Fixed-broadband speed 0. 016
data caps by fixed-broadband service providers is correlated with cheaper entry-level fixed -broadband plans, other things being equal.
ITU data collection considers a minimum of 1 GB Chapter 4. ICT prices and the role of competition
data cap of 1 GB or more, or allows unlimited traffic. In 2013, entry-level fixed-broadband
plans included a data cap in 35 per cent of the countries included in the fixed-broadband
data caps were enforced in entry-level fixed -broadband plans. This suggests that operators can offer cheaper prices in exchange for reduced
data consumption, thus indicating that capacity in fixed-broadband networks is still an issue in several countries,
additional Internet data beyond 1 GB is still non -negligible in many countries Finally, different entry-level fixed-broadband
sector, such as operatorsâ strategies on data caps competition in the fixed-broadband market and the ICT regulatory environment, may together be
is the existence of data caps, which is correlated with lower prices. This may indicate that fixed-broadband capacity
*Data correspond to the GNI per capita (Atlas method) in 2013 or latest available year adjusted with the international inflation rates
GNI p. c. and PPP$ values are based on World bank data Chapter 4. ICT prices and the role of competition
data caps, competition in the fixed -broadband market and the ICT regulatory environment, are together more of
information on the PPP methodology and data, see http://icp. worldbank. org 3 GNI takes into account all production in the domestic economy (i e.
http://data. worldbank. org/indicator/NY. GNP. PCAP. CD 4 Voice over internet services, such as Skype or Voipbuster, are excluded from the analysis in this section
9 Based on 2012 and 2011 ITU data for countries accounting for 97 per cent of global fixed (wired)- broadband subscriptions, it
http://data. worldbank. org/about/country-classifications/country-and-lending-groups 11 The Communications Commission of Kenya (CCK) issued mobile virtual network operator licences to three operators in April
These data suggest there is limited competition among operators 13 Advanced mobile technologies refer to standards agreed by the ITU Radiocommunication Assembly for next-generation
http://www. verizonwireless. com/wcms/consumer/shop/shop-data-plans/more-everything. html 24 The details of the different â 4gâ plans offered by Tigo can be found on the following websites
25 Data for mobile-broadband services have been collected since 2012 through the ITU ICT Price Basket Questionnaire,
http://data. worldbank. org/about/country-classifications/country-and-lending-groups 31 Household incomes include wages, salaries, self employment incomes, capital and property income, private transfers and
35 Differences in the equivalence scales of the source data used in this chapter are corrected roughly using ITU estimates on
World bank Povcalnet data on income distribution are published per capita, and figures from the OECD Database on Income Distribution are equivalized using the square root scale
36 For OECD countries, household income is estimated by multiplying the equivalized household income by the square
According to 2008/9 survey data, mean household consumption in Angola was almost the same as in
38 Data on income distribution are aggregated per household and then equally attributed to each member of the household
In addition, data on income distribution are averaged per decile. If the price of a fixed-broadband plan represents less
In conclusion, the data presented do not determine whether 100 per cent of a population within a country can afford a fixed-broadband plan,
The World Bankâ s Povcalnet data are adjusted using ITU estimates on the average number of inhabitants per household
ITU Tariff Policies Database 2013 (ICTEYE http://www. itu. int/net4/ITU-D/icteye 48 For a detailed description of the ITU ICT Regulatory Tracker, see www. itu. int/tracker
market share data were available. This includes 95 economies from the developing world and 44 from the developed world
economies for which price and market share data were available for 2013. This includes 99 economies from the developing
share data were available. This includes 96 economies from the developing world and 44 from the developed world
Chapter 5. The role of big data for ICT monitoring and for development 5. 1 Introduction
-to date and reliable data, in particular from developing countries. The information and communication technologies (ICT) sector is
of those statistics and identify new data sources In this context, the emergence of big data holds
great promise, and there is an opportunity to explore their use in order to complement the existing, but often limited, ICT data
There is no unique definition of the relatively new phenomenon known as big data. At the most basic level it is understood as being data
sets whose volume, velocity or variety is very high compared to the kinds of datasets that
have traditionally been used. The emergence of big data is linked closely to advances in ICTS. In todayâ s hyper-connected digital world, people
and things leave digital footprints in many different forms and through ever-increasing data flows originating from, among other things
commercial transactions, private and public records that companies and governments collect and store about their clients and citizens, user
Big data have great potential to help produce new and insightful information, and there is a growing debate on how businesses
benefits of big data. Although it was the private sector that first used big data to enhance
efficiency and increase revenues, the practice has expanded to the global statistical community The United nations Statistical commission
NSOS) are looking into ways of using big data sources to complement official statistics and better meet their objectives for providing timely
value added by big data in the context of monitoring of the information society, and Chapter 5. The role of big data for ICT monitoring and for development
174 there is a need to explore its potential as a new data source. While existing data can provide
a relatively accurate picture of the spread of telecommunication networks and services there are significant data gaps
when it comes to understanding the development of the information society. Relatively little information for example, is available on the demand side
While an increasing number of countries currently collect data on the individual use of ICTS, many developing countries do not
produce such information (collected through household surveys or national population and housing censuses) on a regular basis Consequently, not enough data are available
about the types of activity that the Internet is used for, and little is known about the Internet
public services, even fewer data are available to show developments over time and enable informed policy decisions.
society, and that big data have the potential to help realize those efforts In addition to the data produced
and held by telecommunication operators, the broader ICT sector, which includes not just telecommunication companies but also over-the-top (OTT) service
captures a wide array of behavioural data Together, these data sources hold great promise for ICT monitoring,
and this chapter will explore the potential of todayâ s hyper-connected digital world to expand on existing access and
on the role and potential of big data when it comes to providing new insights for broader
Big data are already being leveraged to understand socio -economic well-being, forecast unemployment and analyse societal ties.
Big data from the ICT industry play a particularly important role because they are the only stream of big
data with global socioeconomic coverage. In particular, mobile telephone access is quasi -ubiquitous, and ITU estimates that by the end of
big data hold great promise for development However, while there are a growing number of research collaborations and promising proof
data for development, highlight advances, point to some best practices and identify challenges including in regard to the production and
sharing of big data for development The chapter will first (in Section 5. 2) describe some of the current big data trends and
definitions, highlight the technological developments that have facilitated the emergence of big data, and identify the main
sources and uses of big data, including the use of big data for development and ICT monitoring.
Section 5. 3 will examine the range and type of data that telecommunication companies, in particular mobile-cellular
operators, produce, and how those data are 175 Measuring the Information Society Report 2014 currently being used to track ICT developments
and improve their business. Section 5. 4 looks at the ways in which telecom big
data may be used to complement official ICT statistics and assist in the provision of new evidence for a host of policy domains
while Section 5. 5 discusses the challenges of leveraging big data for ICT monitoring and broader development, including in terms of
standardization and privacy. It will also make some recommendations for mainstreaming and fully exploiting telecom big data for monitoring
and for social and economic development in particular with regard to the different stakeholders involved in the area of big data
from the ICT industry 5. 2 Big data sources, trends and analytics With the origins of the term âoebig dataâ being
shared between academic circles, industry and the media, the term itself is amorphous, with no single definition (Ward and Barker, 2013
Sources of big data Sources Some examples Administrative data â¢Electronic medical records â¢Insurance records â¢Tax records
Commercial transactions â¢Bank transactions (inter-bank as well as personal â¢Credit card transactions â¢Supermarket purchases
increasing amounts of data from different data sources. Indeed, one of the key trends fostering
the emergence of big data is the massive âoedataficationâ and digitization, including of human activity, into digital âoebreadcrumbsâ or
âoefootprintsâ In an increasingly digitized world, big data are generated in digital form from a number of
sources. They include administrative records for example, bank or electronic medical records), commercial transactions between two
Big data is not just about the volume of the data. One of the earliest definitions, introduced
by the Gartner consultancy firm, describes big data characteristics such as velocity and variety, in addition to volume (Laney, 2001
âoevelocityâ refers to the speed at which data are generated, assessed and analysed, while the
Chapter 5. The role of big data for ICT monitoring and for development 176 term âoevarietyâ encompasses the fact that data
can exist as different media (text, audio and video) and come in different formats (structured and unstructured.
The three-Vs definition has caught on and been expanded upon. A fourth V â veracity â was introduced to capture aspects
relating to data quality and provenance, and the uncertainty that may exist in their analysis (IBM
data (Jones, 2012)( Figure 5. 1 Included within the scope of big data is the category of transaction-generated data (TGD), 2
also sometimes described as âoedata exhaustâ or âoetrace dataâ. These are digital records or traces
that have been generated as by-products of doing things (such as processing payments making a phone call and so on) that leave behind
big data is that it is connected directly to human behaviour and its accuracy is generally high
Most of the data captured by telecommunication companies can be classified as TGD As is often the case with technological
The five Vs of big data Source: ITU at the forefront of extracting value from this
data deluge. Encouraged by promising results but also reduced budgets, the public sector is turning towards big data to improve its service
delivery and increase operational efficiency. In addition, there are uses for big data in broader development and monitoring, and there is an
increasing focus on big dataâ s role in producing timely (even real-time) information, as well as new insights that can be used to drive social and
Big data uses by the private and pub -lic sectors Marketing professionals, whose constant aim is to understand their customers,
preferences from the analysis of big data Walmart, the worldâ s biggest retailer, has been one of the largest and earliest users of big data
In 2004, it discovered that the snack food known as Pop Tarts was purchased heavily by United
of data generated through large-scale datafication and digitization of information Different types and forms of
data, including large amounts of unstructured data Level of quality accuracy and uncertainty of data and data
sources VALUE Potential of big data for socio -economic development VELOCITY Speed at which data are
generated and analyzed 177 Measuring the Information Society Report 2014 a specific condition that then led Walmart to
improve its production chain â in this case, by increasing the supply of Pop Tarts to areas likely
to be affected by a disaster. Walmart has also made use of predictive analytics, which uses personal information and purchasing patterns
to extrapolate to a likely future behaviour, and to better target and address customer needs
value of big data Nor is the private sectorâ s use of big data techniques restricted solely to market research
Companies and whole industries (healthcare energy and utilities, transport, etc. are using such techniques to optimize supply chains and
proficient in their use of data-driven decision making have been found to have productivity levels up to 6 per cent higher than firms making
minimal to no use of data for decision-making Brynjolfsson, Hitt and Kim, 2011. Significantly industries now have the ability to conduct
How big data saves energy â Vestas Wind Systems improves turbine performance Vestas, a global energy company dedicated to wind energy, with
installations in over 70 countries, has used big data platforms to improve the modelling of wind energy production and identify
By using big data techniques based on a large set of factors and an extended set of structured and
unstructured data, Vestas was able to significantly improve customer turbine location models and optimize turbine performance
Big data have enabled the creation of a new information environment and allowed the company to manage and analyse
weather and location data in ways that were previously not possible. These new insights have led to improved decisions
big data techniques to understand and control churn, optimize their management of customer relations and manage their network quality and
These fundamental shifts in data exploitation to generate new socioeconomic value coupled with the simultaneous emergence
of new rich data sources that can potentially be linked together and analysed with ease have sparked also the interest of governments
Encouraged by the potential of big data to produce new insights and slimmer budgets governments (at all levels) are now looking to
exploit big data and increase the application of data analytics to a range of activities, including
monitoring and improvement of tax compliance and revenues, crime detection and prediction and improvement of public service delivery
data they collect and generate themselves Chapter 5. The role of big data for ICT monitoring and for development
178 complement their official statistics by leveraging data from new sources, including crowd-sourced data generated by the public
In the United states, for example, Boston City Hall released the mobile app âoestreet Bumpâ which uses a phoneâ s accelerometer to detect
potholes while the app user is driving around Boston and notifies City hall. 3 Some of the
richest data sources for enabling governments and development agencies to improve service delivery are actually external.
data include those captured and/or collected by the private sector, as well as the digital breadcrumbs left behind by citizens as they go
can make use of public and private databases and bigâ data analytics to improve public
administration, from land management to the administration of benefits. The Department of the Treasury has set up a âoedo Not Payâ portal
which links various databases and identifies ineligible recipients to avoid wrong payments and reduce waste and fraud4 (The White house
Big data for development and ICT monitoring One of the richest sources of big data is the data captured by the use of ICTS.
This broadly includes data captured directly by telecommunication operators as well as by Internet companies and by content
providers such as Google, Facebook, Twitter etc. Big data from the ICT services industry are already helping to produce large-scale
development insights of relevance to public policy. Collectively, they can provide rich and potentially real-time insights to a host
of policy domains. It should be noted that in some countries and regions the use of big
data, including big data from the ICT industry is subject to national regulation. In the EU, for
example, a number of directives require data producers to obtain usersâ consent before gathering any of their personal data. 5
data in action and of the great potential of big data for broader development and monitoring
Mayer-Schã nberger and Cukier, 2013; Mcafee and Brynjolfsson, 2012. GFT worked by monitoring health-seeking behaviour expressed
focusing on the use of search-engine data to understand dengue fever outbreaks, 6 monitor prescription drug use (Simmering, Polgreen and
data beyond the realm of user search terms Online job-posting data are being used to
supplement traditional labour statistics in the United States7 and other countries. In another effort, an academic project at MIT known as the
data from hundreds of online retailers. 8 The data are used then by researchers to understand a
UN Global Pulse, a UN initiative to use big data for sustainable development and humanitarian action, has been mining Twitter data from
Indonesia (where Twitter usage is high) 9 to understand food price crises. Global Pulse was
analytics on the Twitter data to forecast the consumer price index several weeks in advance Byrne, 2013. As discussions on the post-2015
Pulse is also using Twitter data to understand and compare the relevance of different development topics among countries (Box 5. 2
Campaign are using big data and visual analytics to identify the most pressing development topics that people around the world
as a source of big data for monitoring purposes Regulators and others are now using the
Qos) data on broadband quality. For example the United states Federal Communications Commission (FCC) has released mobile apps that enable consumers to check their
Chapter 5. The role of big data for ICT monitoring and for development 180 Mobile data
Despite the rapid growth in Internet access 60 per cent of the worldâ s population is still
Depending on the source of Internet data results may also be biased more or less. A 2013 study into the characteristics and behaviour
is very similar to real life (census) data on the average age when American people get
-network big data seems to have the widest socioeconomic coverage in the near term and the greatest potential to produce
Mobile data are already being utilized for research and policy-making, not only in developed but also in developing economies
economic developments. 13 Data are also being used to improve responsiveness in the event of natural disasters or disease outbreaks.
data to understand the spread of malaria in Kenya (Pindolia et al. 2012; Wesolowski et al 2012a), and of cholera in Haiti after the 2010
Mobile network big data have been utilized to great effect in the area of transportation helping to measure and model peopleâ s
data from the ICT sector, and especially those available to telecommunication operators have wide applicability for informing multiple
Leveraging such data to complement official statistics and facilitate broader development will enable governments as well as development agencies to better
big data with a view to understanding its potential for producing additional information and statistics on the information society
better understand the type of data that can be made available 5. 3 Telecommunication data
and their potential for big data analytics Fixed and mobile telecommunication network operators, including Internet service providers
ISPS), are an important source of data and for the purpose of this chapter, all forms of
telecommunication big data (either volume velocity or variety) are being considered. Most telecommunication data can be considered as
TGD, 14 that is, the result of an action undertaken such as making a call, sending an SMS,
accessing the Internet or recharging a prepaid card Since the service with the widest coverage and
-cellular service, data from mobile operators have the greatest potential to produce representative results and reveal developmental insights
Not surprisingly, the big data for development initiatives (outlined in Section 2. 2) have mainly drawn on mobile network big data rather than
on those from fixed-telephone operators or ISPS. Figure 5. 2 illustrates some of the similarities
Telecommunication data The mobile telecommunication data that operators possess can be classified into different types, depending on the nature of the information
they produce. They include traffic data, service access detail records, location and movement data, device characteristics, customer details and
tariff data. For a more detailed overview of these types of data, see Chapter 5 Annex
To collect traffic data, operators use a range of metrics to understand and manage the traffic
flowing through their networks, including the measurement of Internet data volumes, call, SMS and MMS volumes, and value-added service
VAS) volumes. Internet service providers can also use deep packet inspection (DPI), 15 which is a special process for scanning data packages
transiting the network Service access detail records, including call detail records (CDRS), are collected by operators
whenever clients use a service. They are used to manage the infrastructure and for billing purposes, and include information on the
time and duration of services used and the technology used, for example, for the mobile network (2g, 3g, etc..
These data are potentially also very useful for building a rich profile of customers, as outlined in this section
positioning data, with the latter providing more detailed and precise location information Since mobile user devices used to access mobile
Chapter 5. The role of big data for ICT monitoring and for development 182 Figure 5. 2:
An overview of telecom network data Source: ITU, adapted from Naef et al. 2014 Tr affi
Data volume Call volume SMS/MMS volume Erlang DPI data Timestamp of use Contact network
Duration of use Applicable charges Handset type Technology utilized (2g, 3g, DSL/ADSL, etc Billing address
Passive positioning data e g. cell ID IMEI Active positioning data e g. cell triangulation, GPS MAC address
Customer demographics (e g. age, gender, national ID card number Billing address Payment history (postpaid Billing address
Transaction generated data Stored warehouse data 183 Measuring the Information Society Report 2014 subscriberâ s purchasing power (see below for
Finally, operators maintain tariff data in the form of billing records for their current and past
How mobile operators currently use data to track service uptake, business performance and revenues Operators use their TGD to monitor the uptake and penetration
On the basis of the detailed service-usage data collected telecommunication operators can produce a range of detailed
of originating and terminating calls, SMS and MMS usage, data upload volumes, data download volumes, level of use of different
VAS, and level of use of different OTT services. These data can be reported as averages (over time or for different categories of
user), as well as at various levels of aggregation (again over time or for different categories of user.
Finally, service consumption data are used to produce revenue data and projections at various levels of disaggregation or
aggregation. For example, the average revenue per user (ARPU) is a KPI for operators, which identify their most important customers
will, for instance, often associate revenue data with resource allocation to ensure that Qos at the base stations used by their
others, use aggregated revenue data to track and benchmark countriesâ ICT developments, monitor the evolution of the information society and
The telecom industryâ s use of big data Telecommunication companies are actively seeking to intensify their use of big data analytics
in order to improve existing services and create new ones. For operators, big data open up opportunities for better understanding of their
customers, which in turn leads to improved sales and marketing opportunities. At the same time big data can help optimize network operations
and create new revenue streams and business lines, for example when selling data Customer profiling Telecom operators capture a range of
behavioural data about their customers Chapter 5. The role of big data for ICT monitoring and for development
184 Customer profiles include details about customersâ mobility patterns, social networks and consumption preferences. Collectively, these
digital breadcrumbs enable operators to profile and segment their customers based on a variety of metrics (Figure 5. 3). Depending on the country
data regulations governing the manner in which operators may keep and/or use such data.
This being the case, the extent to which behavioural profiling is used by operators may vary greatly
Big data, on the other hand can help to enhance that classification by enabling analysis of the levels of
Big data techniques can help operators understand churn better by enabling them to model the likelihood
Customer profiling using telecom big data Source: ITU CUSTOMER INTERESTS SOCIO -ECONOMIC CLASS LEVEL OF INFLUENCE OF
subsidiary of SK TELECOM, uses big data to help its parent company to cut churn and generate new
revenue, and has used data mining to achieve a fourfold improvement in churn forecasting. The operator found that customers planning to quit
also seek to monetize the data they hold. The simplest way of doing this is to sell (anonymized
data to third parties. The customer insights obtained through the analysis of usage data can also help create new business lines, either
through innovation (e g. new types of VAS) or by partnering with other businesses, including credit-scoring and related financial services
One example is based the US big data startup Cignifi, 19 which obtains data from mobile operators and financial institutions to build credit
profiles and evaluate customer creditworthiness see Box 5. 8). Cross-promotions with brick-and -mortar businesses are a potentially high-growth
5. 4 Big data from mobile telecommunications for development and for better monitoring In 2013, the United nations High-level Panel of
on existing and new sources of data for the post-2015 development agenda (United nations 2013).
a report on âoebig data and modernization of Chapter 5. The role of big data for ICT monitoring and for development
186 statistical systemsâ, and proposed the creation of a big data working group at the global level
UNSC, 2013). 20 Current uses of big data to complement official statistics are still exploratory but there is a growing interest in this topic, as
evidenced by the numerous initiatives being pursued by the United nations, as well as by others, including the World bank, OECD, Paris21
There are many big data sources that can be used to monitor and assess development results. In a world where mobile telephony is
mobile telecommunication big data have unique potential as a new data source, with high mobile -cellular penetration levels and the increasing
use of mobile phones, even among the poorest and most deprived, making them particularly valuable by comparison with other types of
telecommunication data. Indeed, when referring to the data revolution, the United nations High-level Panel cited the example of âoemobile
technology and other advances to enable real -time monitoring of development resultsâ This section will present some of the existing
data in achieving development goals in various policy areas, including disaster management and sustainable and economic development
telecommunication big data have potential as a source to enable monitoring of the information society, although they have yet to assume a critical
potential of big data to complement its existing and often limited, set of ICT statistics. This section
big data could complement existing ICT indicators to provide a more complete comprehensive and up-to-date picture of the
Mobile phone big data for develop -ment Mobile data offer a view of an individualâ s behaviour in a low-cost, high-resolution, real
-time manner. Each time a user interacts with a mobile operator, many details of the interaction
fact that these data are uniquely detailed and tractable, the information captured cannot easily be derived from other sources on such
The fact that the format of the data is relatively similar across different operators and countries creates a huge potential for
potential of mobile data for development in a number of different areas There have been a number of interesting
Big data for disaster management and syndromic surveillance21 Mobility data collected immediately after a disaster can in many cases help emergency
responders to locate affected populations and enable relief agencies to direct aid to the right
One application of such mobility data is for syndromic surveillance, especially to model the spread of vector-borne22 and
Using mobile data for development A recently published report (Cartesian, 2014) explores the potential of mobile data for development.
It points to three primary types of analysis-ex-post evaluation, real-time measurement, and future predictions and planning-in a
mobile data Migration monitoring Text analysis economic downturn prediction Text analysis commodity fluctuation prediction Assessment
Mobile data to track food assistance delivery Geo-targeted links between Ag suppliers /purchasers Pests, bad
positioning data with malaria prevalence data to identify the source and spread of infections Wesolowski et al.
showed how mobile phone data was used to track the spread of cholera after the 2010 earthquake (Bengtsson et al.
The integration of mobility data from mobile networks with geographic information frameworks, 23 supplemented with additional
Chapter 5. The role of big data for ICT monitoring and for development 188 Big data for better transportation planning
A data-centric approach to transportation management is already a reality in many developed economies.
Transportation systems are being fed with sensor data from a multitude of sources such as loop detectors, axle counters
parking occupancy monitors, CCTV, integrated public transport card readers and GPS data derived not only from phones but also from
public transport and private vehicles (Amini Bouillet, Calabrese, Gasparini and Verscheure 2011 One advantage of mobile networks is that
data, which, despite its limited spatial accuracy cell ID), has great potential for transportation planning.
data from mobile operator Orange to map out citizensâ travel routes in Abidjan, the largest city in
and show how data-driven insights could be used to improve the planning and management of transportation services, thereby
Passive mobile positioning data has also been used for transportation planning and management in Estonia (Ahas and Mark, 2005
How mobile network data can track population displacements â an example from the 2010 Haiti earthquake
was produced on the basis of mobile network data to show the potential of big data in tracking population movements
Source: Bengtsson et al. 2011 Figure Box 5. 5: Tracking mobility through mobile phones Port-au-prince (Pap Number of
Leveraging mobile network data for transportation and urban planning in Sri lanka Very similar findings between the results of an official household
results of a big data analysis using mobile-phone data (left-hand map) underscore the merits of big data.
The image on the left based on mobile-phone data, depicts the relative population density in Colombo city and its surrounding regions at 1300
hours on a weekday in 2013, compared to midnight the previous day. While the yellow to red colouring shows areas in which the
Mobile big data (left) versus official survey data (right Both passive and active positioning data are used to
analyse traffic conditions, particularly in urban areas with higher base-station density. Active positioning data (especially GPS) produce higher precision in
location data and are therefore the most useful Operators may offer such specialized services based on passive or active location data) either
directly, or by providing data to third parties. Mobile network data are less expensive, are in real time and
are less time-consuming to produce than survey data, particularly in urban and peri-urban areas
where base-station density tends to be high In another example, the analysis of mobility
flows between two Spanish cities derived from three different data sources â mobile -phone data, geolocated Twitter messages and
the census â showed very similar results, and although the representativeness of the Twitter geolocated data was lower than the (real-time
mobile-phone and census data, the degrees of consistency between the population density profiles and mobility patterns detected by
means of the three datasets were significant Lenormand et al, 2014 Chapter 5. The role of big data for ICT monitoring and for development
190 Big data for socioeconomic analysis Data from mobile operators can provide insights in the areas of economic development and
socioeconomic status, often in near real time Big data techniques can therefore complement official statistics in the intervals between official
surveys, which are usually relatively expensive and time-consuming and therefore carried out infrequently. In many cases, insights derived
from big data sources may help to fill in the gaps rather than replace official surveys. It should
also be noted that mobile network big data are one of the few big data sources (and often the only one) in developing economies that
contain behavioural information on low-income population groups Frias-Martinez et al. 2012) developed a mathematical model to map human mobility
variables derived from mobile network data to peopleâ s socioeconomic and income levels. The model took into account existing socioeconomic
and income-related data derived from official household surveys, and the results showed that populations with higher socioeconomic levels
are associated more strongly with larger mobility ranges than populations from lower socio -economic levels. By extending this method, the
study suggested that it was possible to create a model to estimate income levels based on data
from mobile network operators Another study, by Gutierrez, Krings and Blondel 2013), used two types of mobile network
data, namely subscriber communication data and airtime credit purchase records, to assess socioeconomic and income levels.
The authors used airtime purchase records based Box 5. 7: Poverty mapping in CÃ'te dâ Ivoire using mobile network data
In CÃ'te dâ Ivoire, researchers used mobile network data specifically communication patterns, but also airtime credit
purchase records) from Orange to estimate the relative income of individuals, as well as the diversity and inequality of income
levels. The research helped to understand socioeconomic segregation at a fine-grained level for CÃ'te dâ Ivoire, with the
regarding revenue data and the difficulty this poses for outside parties wishing to obtain such
data The use of mobile-operator TGD can also foster financial inclusion by facilitating the provision of
example of how mobile big data can be used for the unbanked is Cignifi, a big data startup that uses
the mobile phone records of poor people to assess their creditworthiness when they apply for a loan
Big data for understanding societal structures Social-network studies relying on self-reporting relational data typically involve both a limited
number of people and limited number of time points (usually one. As a result, social-network
patterns based on telecommunication data it has become possible to obtain insights into societal structures on a scale that was
Using mobile-phone data to track the creditworthiness of the unbanked Cignifi, a big data startup, has developed an analytic platform
to provide credit and marketing scores for consumers, based on their mobile-phone data. The Cignifi business model is founded
on the idea that âoemobile phone usage is not random â it is highly predictive of an individual consumerâ s lifestyle and riskâ
phone calls, text messages, data usage and, extrapolating from these, lifestyles â the company identifies patterns and uses
respond to changes in customer activity as the data are refreshed usually every two weeks. In addition to updating a personâ s
findings from the model against historical lending data from approximately 40 000 borrowers using the mobile operator Oiâ s
Chapter 5. The role of big data for ICT monitoring and for development 192 have been used to study the geographic
However, telecommunication data are also revolutionizing the study of societal structures at the micro level.
is possible to assess friendship using data from mobile network operators, and that the accuracy is compared high when with self-reported
data. Leveraging these behavioural signatures to obtain an accurate characterization of relationships in the absence of survey data could
also enable the quantification and prediction of macro and micro social-network structures that have thus far been unobservable
Big data to monitor the information society There is a case to be made for analysing data captured by telecommunication operators in
the interests of improving the current range of indicators used for monitoring the information society.
big data analytics. 25 The core indicators on ICT infrastructure and access include indicators on mobile-cellular and
mobile-broadband subscription data is that they do not refer to unique subscriptions, or mobile
to monitor the time during which a SIM CARD remains inactive. In June 2014, for example GSMA estimated that, globally, the number of
Survey-based data, for example on Internet users and mobile-phone users, do not entail the same issues as subscription data.
They are collected through household surveys, directly from citizens, and their level of reliability is relatively
Survey-based data can also be broken down by individual characteristics, including gender, age, educational level and occupation
these data is that they are not widely available in particular, many developing countries do not yet collect data on individual use of
or household access to, ICTS), are relatively expensive to produce, and are much less timely
than subscription data (often with a time lag of one year. Consequently, data on users of
the information society and the types of online service they consume are limited, and in many
networks and mobile big data could be used to identify alternative, less costly and faster ways of
administrative data from operators and survey data collected by NSOS, it is particularly interesting to assess some of the ways in
which big data can be used to overcome the shortcomings of existing key ICT indicators and to provide additional insights into ICT access
and use, user behaviour, activities and also the individual user. Big data could help in obtaining
more granular information in several areas, and big data techniques could be applied to existing data to produce new insights.
In particular operatorsâ big data could produce information in the following areas Individual subscriber characteristics: Additional categorization across both time and space are
possible for subscription indicators, and big data could provide additional information on gender, socioeconomic status and user location
Information on gender or age, for example, could be derived from customer registration information notwithstanding a number of challenges and
privacy issues, as discussed later in Section 5. 5 The socioeconomic status of the person linked
data techniques applied to usersâ consumption information, as well as other data contained in customer registration information.
In addition, the analysis of customersâ mobility patterns will often allow for an understanding of important locations
All subscription data could provide information as to location. In the case of fixed-telephone and fixed-broadband
to other (administrative) databases in order to Box 5. 9: Using mobile big data and mobile networks for implementing surveys
An important measurement for assessing the development of the information society is the extent to which households
surveys for collecting the corresponding data, and the declining response rates where traditional surveys are concerned (Groves
collection of survey data. This could include targeting a wide variety of respondents covering the full spectrum of appropriate
using big data analytics. In 2011, for example, UN Global Pulse partnered with Jana, 27 a mobile-technology company,
Chapter 5. The role of big data for ICT monitoring and for development 194 create new information.
Particularly rich possibilities exist where data from mobile-cellular and mobile-broadband subscriptions are concerned, since they are
voice, data and VAS â over time, and the intensity of use. Mobile operators are able to provide
In addition, mobile-operator data could be combined with customer information from popular online services, such as Facebook
the profiles developed using data from online services with customer profiles generated from analyses of mobile-operator data.
This would require telecommunication operators, OTT providers and other Internet content providers to work together and share information
By applying big data techniques to survey data and administrative data from operators, new insights could be derived, in particular, in respect
of the following Subscriptions versus subscribers: Big data techniques could help extrapolate the actual number of unique mobile subscribers or users
rather than just subscriptions, by comparing subscription numbers to user numbers derived from household surveys,
account usage patterns or data from popular Internet companies such as Google or Facebook By linking data collected from different sources
and combining subscription data and usage patterns, a correlation algorithm could be developed to reverse engineer approximate
values for these indicators, in order to estimate user numbers in between surveys, and possibly in real time. This could be pursued in a similar
variables using mobile-phone usage data, as described in greater detail at the beginning of this section. It is important to note here that
data methods only complement existing surveys rather than replacing them completely (see Section 5. 5 for a further discussion of this
In sum, relatively simple big data techniques can help analyse and provide complementary information on existing ICT data, and provide
new insights into the measurement of the information society. This includes information on the use of different services and applications
SIM usage and the fact that users will in many cases be using ICT services from more than
combining data from surveys with big data to build new correlation and predictive analytic techniques Finally, it should be noted that the methods that
and for other big data for development projects, big data analysis cannot replace survey data, which is needed to build and
test correlations and to validate big data results While the opportunities discussed above present what is analytically possible, data access
and privacy considerations are complex and nuanced, and therefore place constraints on what is practically feasible or advisable.
Section 5. 5 discusses these challenges in greater detail 5. 5 Challenges and the way
forward Attempting to extract value from an exponentially growing data deluge of varying structure and variety comes with its share of
challenges. The most pressing concerns are those associated with the standardization and interoperability of big data analytics, as
well as with privacy and security. Addressing such privacy and other concerns with respect to data sharing and use is critical,
and it is important for big data producers and users to collaborate closely in that regard. This includes
raising awareness about the importance and potential of producing new insights, and the establishment of public-private partnerships
to exploit fully the potential of big data for development Data curation, standardization and continuity Data curation and data preparation help to
structure, archive, document and preserve data in a framework that will facilitate human understanding and decision-making.
Traditional curation approaches do not scale with big data and require automation, especially since 85 per cent of big data are estimated to be
unstructured (Techamerica Foundation, 2012 Dealing with large heterogeneous data sets calls for algorithms that can understand the data
shape while also providing analysts with some understanding of what the curation is doing to
the data (Weber, Palmer and Chao, 2012 Telecom network operators themselves have to contend with interoperability issues arising
from the different systems (often from different vendors) they employ. It is not uncommon for
operators to write customized mediation software to overcome potential inter-comparability issues among data from different systems.
The problems are compounded when one has to take account of secondary third-party users that may seek to
leverage the data. The framework used by an NSO to organize data would be different from that
used by a network engineer or a marketing or business intelligence specialist. Naturally, telecom network operators have curated their data
based on their needs. To be able to use telecom big data for development and monitoring and to guarantee its continuity, the creation of
a semantic framework would require greater consensus among the many diverse stakeholders involved (telecom operators, network equipment
manufacturers, system developers, developmental practitioners and researchers, NSOS, etc Chapter 5. The role of big data for ICT monitoring and for development
196 Accessing and storing data, and data philanthropy Big data for development is still in its nascent
stages and, as such, comes with its share of challenges, not least of which is obtaining access to what is essentially private data.
Private corporations would hesitate to share information on their clients and their business processes in case such sharing is illegal, precipitates a
loss of user confidence and/or accidentally reveals competitive business processes. More importantly, companies will not share until there
Until holders of big data become more comfortable about their release it is going to be difficult for third-party research
big data, but it has taken them considerable time to build and leverage the necessary relationships with operators.
the necessary parameters as to how data are to be used, including the manner in which they
challenges associated with extraction of the data on account of different curation approaches and problems relating to the interoperability of
towards sharing data more publicly. Orange for example, hosted a âoedata for Development Challenge, â releasing an aggregated anonymized
using Orange data from Senegal, is planned for 2015.30 In 2014, Telecom italia initiated a similar challenge, making data from the territories of
Milan and the Autonomous Province of Trento available to researchers for analysis. 31 It has
partnered with other data providers to curate and release additional big datasets containing weather, public and private transport, energy
event and social network data. In both the Orange and Telecom italia cases, researchers had to go through an approval process in order to
regular and safe sharing of data by building on the precedents being created by the ad hoc
the efforts being made to open up private-data stores in order to obtain actionable development insights There is a gap that needs to be addressed if
large-scale pooling and sharing of such data are to become a reality. Cross-sector and cross
and data-curation practices when pooling data from multiple sources. This facilitatory role may even be played by a third-party organization
faced by data providers, effectively acting as a gatekeeper to ensure that data are used transparently and in a way that contributes to
overall scientific knowledge generation, while ensuring that any safeguards that may be applicable in respect of private information are
trial data. To facilitate the process, they hired Yale Universityâ s Open Data Access (YODA) Project
197 Measuring the Information Society Report 2014 to act as gatekeepers (Krumholz, 2014. YODA undertakes the necessary scientific review of any
make use of the data and ensures that necessary privacy and data usage guidelines are followed. 32
The question remains as to who is placed best to act as gatekeepers and standard-bearers when it comes to telecom network big data
Some have argued that NSOS are placed well to ensure that best practices are followed in the collection and representation of big data
and to provide a stamp of trust for potential third-party data seekers. Telecom operators for their part, are regulated mostly by sector
-specific regulators who can also have purview and dictate terms governing the privacy and data-reporting responsibilities of operators
Ultimately, however, the decision as to who takes on the gatekeeper and standardization function requires the confluence of multiple
to play in building an institutional model for data sharing and collaboration, in consultation with all
protocols) of privately held data such as mobile -phone records can be mutually beneficial to both government and private sector.
-action system shows how data sharing could be considered a business risk mitigation strategy for operators in emerging markets.
big data is linked closely to advances in the ICT sphere, including the falling cost of data storage
Depending on the data volume, storage can still be costly, especially where privacy considerations preempt the use of specialized third-party
cloud-based services. But as storage prices continue to fall, they are expected to be less of
As social scientists look towards private data sources, privacy and security concerns become paramount. To mitigate the potential risks, all
such data sharing. These stakeholders include not just the public and private sectors, but also significantly, the general public, who in many
cases are the primary producers of such data through their activities. It is also the public that
must ultimately decide on how the data they produce may be used. The World Economic Forumâ s âoerethinking Personal Dataâ project
new data ecosystem. 33 Discussions must address the individualâ s privacy expectations, as well as those of private-sector stakeholders looking
identifiable individual (data subject) â (OECD 2013). ) The result of such an approach has been the policy of âoeinform and consentâ practised by
most companies to inform users of what data are Chapter 5. The role of big data for ICT monitoring and for development
198 being collected and how they will be used. It has been argued, however, that in a big data world
the âoeinform and consentâ approach is woefully inadequate and impractical, and that a new approach is needed (Mayer-Schã nberger and
data paradigm, the greatest potential often lies in secondary uses, which may well manifest long after the data was collected originally.
It is thus impractical for companies to have a priori knowledge of all the potential uses and to seek
Given the volumes of data that individuals are now generating, companies would find themselves struggling to maintain
privacy issues that may arise when data from one source is combined with data from other sources
to reveal/infer new data and insights. This blurs the lines between personal and nonpersonal
information, allowing seemingly nonpersonal data to be linked to an actual individual (Ohm 2010). ) Digitized behavioural data crumbs may in
fact greatly diminish personal privacy. The use of DPI, for example can technically reveal all of a userâ s online activity.
Going one step further it is possible to understand a personâ s needs behaviours and preferences by using data
-mining techniques on the digital breadcrumbs For instance, a recent study showed how Facebook âoelikesâ could accurately predict a range
Data anonymization35 (i e. methods designed to strip data of personal information), employed by computational social scientists, has been
called into question (Narayanan and Shmatikov 2008). ) A recent study of mobile CDRS for 1. 5
out that the data could in fact be de-anonymized completely by cross-referencing them with other
data sources. The attendant privacy concerns about such cross-referencing are clear, and have to be taken seriously and addressed
on the country, SIM resellers may pre-register the SIMS they sell under their own name, and
SIMS that are registered by one family member may be used by other members of the same family.
Sri lankan operators, for example, see a great mismatch between the person registering a subscription and the person using it.
data are kept secure. Data breaches undermine consumer confidence and hinder efforts to exploit big data for the greater social good.
Encryption virtual private networks (VPNS), firewalls, threat monitoring and auditing are some potential technical solutions that are employed currently
199 Measuring the Information Society Report 2014 but they need to be mainstreamed (Adolph 2013).
of big data, the World Economic Forum (WEF initiated a global multi-stakeholder dialogue on personal data that advocated a principle-based
approach that shifts governance from the data per se to its use; acknowledges the importance of context rather that treating privacy as a binary
privacy and data protection in a big data world the danger is that these questions may take too
use of big data for broader development. Hence a balanced risk-based approach may be required in the context of what is under discussion here
i e. the use of telecom big data for monitoring and development. This does still require the
of big data for development can be âoesandboxedâ with appropriate privacy protections imposed on researchers, while still ensuring that the broader
input data. In the big data paradigm, it is easy to overlook that concept, given the expectation
that when dealing with vast volumes of (often unstructured) data from a multitude of sources
âoemessinessâ is to be expected. As Mayer -Schã nberger and Cukier (2013) note, âoewhat we lose in accuracy at the micro level we gain
Data quality and their provenance do matter, and the question is important in establishing the
generalizability of the big data findings Data provenance and data cleaning Understanding data provenance involves tracing the pathways taken by data from the
originating source through all the processes that may have mutated, replicated or combined the data that feed into the big data analyses.
This is no simple feat. Nor, given the varied sources of data that are utilized, is it always as feasible as
the scientific community would wish. However at the very least it is important to understand
some aspects of the origin of data. For example the fact that some mobile network operators choose to include the complete routing of a call
that has been forwarded means that there may be multiple records in the CDRS for the same
call. If that is not taken into consideration, the subsequent social network analysis could contain errors (overstating or understating tie strength
establish data provenance as envisaged by scientists, it is at the very least important to
have created the data Data cleaning remains a key part of the process to ensure data quality.
It is important to verify that the quantitative and qualitative (i e categorical) variables have been recorded as
expected. In a subsequent step, outliers must be removed, using decision-tree algorithms or other techniques.
However, data cleaning itself is a subjective process (for example, one has to decide which variables to consider) and not a
Are the data representative Related to the question of data provenance is the issue of understanding the underlying
Chapter 5. The role of big data for ICT monitoring and for development 200 population whose behaviour has been captured
The large data sizes may make the sampling rate irrelevant, but they do not necessarily
make it representative. Not everyone uses Twitter, Facebook or Google. For example ITU estimates suggest that 40 per cent of the
considering how telecommunication data may be used for monitoring and development While the potential benefits to be gained from
leveraging mobile network operator data for monitoring and development purposes hinges on the large coverage, close to the actual
data does not automatically mean that issues such as measurement bias and methodology internal and external data validity and data
interdependencies can be ignored. These are fundamental issues not just for âoesmall dataâ but also for âoebig dataâ (Boyd and Crawford, 2012
personas, so studying peopleâ s data exhaust may not always give us insights into real-world dynamics.
where in essence the data artefact is itself a by-product of another activity. Telecom network big data, which mostly fall under this category
may be less susceptible to self-censorship and persona development, but the possibility of these phenomena cannot be ruled out.
In a way, big data analyses of behavioural data are subject to a form of the Heisenberg uncertainty principle, whereby
as soon as the basic process of an analysis is known, there may be concerted efforts to exhibit different behaviour and/or actions to change
considering big data analyses for monitoring purposes. Dr Nathan Eagle, a pioneer in the use of cellphone records to understand phenomena
when CDR data from Rwanda showed low mobility in the wake of flooding he theorized that this was due to an outbreak
-data analyses based on big data. For example prior research had established a power-law distribution between the frequency of airtime
the big data paradigm, leading to the discovery of misleading patterns. As Googleâ s Chief Economist, Hal Varian, notes, âoethere are often
) Big data draws many of its techniques from machine learning, which is primarily about correlation and predictions. 40 Big data are by
their very nature observational and can measure only correlation and not causality. Supporters of big data have predicted the end of theory and
hypothesis-testing, with correlation trumping causality as the most relevant method (Anderson 2008; Mayer-Schã nberger and Cukier, 2013
mining of big data sources) will not drown out traditional deductive science (i e. hypothesis testing), even in a big data paradigm.
Among the three Vs in the traditional big data definition volume and variety produce countervailing forces.
More volume makes big data induction techniques easier and more effective, while more variety makes them harder and less effective
It is this variety issue that will ensure the need for explaining behaviour (i e. deductive science
Chapter 5. The role of big data for ICT monitoring and for development 202 rather than merely predicting it (Mullainathan
Causal modelling is possible in a big data paradigm by conducting experiments. Telecom network operators themselves use such
with âoesmall data, â in this case the statistics collected by the Centers for Disease Control and
note, when combined with small data, âoegreater value can be obtained by combining GFT with other nearâ real time health data. â Where data
from mobile network operators are used for syndromic surveillance, as in the case of malaria in Kenya (Wesolowski et al.
2012a), big data are most useful as a basis for encouraging timely investigation, rather than as a replacement
how telecommunication network data could be used for monitoring, surveys and supplemental datasets will remain important to sharpen the
prior to data anonymization to build a training dataset. This enabled them, for example, to understand variations in mobility, social networks
Virseda (2012) needed census data to build their algorithms and provide training data for their
algorithms to reverse engineer approximate survey maps. Official statistics will thus continue to be important to building the big data models
and for periodic benchmarking so that the models can be tuned fine to reflect ground realities Transparency and replicability
and replicability problems with big data research The fact that the original private data may in many cases not be available to everyone
underscores the importance of opening up such private-data sources (in a manner that addresses
potential privacy concerns) or of peer reviews that can hone and improve the analyses. Instead consumers of such research have no option but
data calls for a combination of specialized skills in the areas of data mining, statistics and
domain expertise, as well as data preparation cleaning and visualization. NSOS may have deep statistical skills in house, but this is not
enough when it comes to working with large volumes of big data calling for computer science and decision-analysis skills that are not
emphasized in traditional statistical courses Mcafee and Brynjolfsson, 2012. NSOS recognize this shortcoming. In a recent global survey of
data sources (UNSC, 2013. Currently, there is a mismatch between the supply of and demand for talented individuals with the requisite
i e. data scientists. Mckinsey predicts that by 2018 the demand for data-savvy managers and analysts in the United states will
amount to 450 000, whereas the supply will fall far short of this, at only 160 000 (Manyika et al
to leverage big data for development will face competition from the private sector when seeking to attract the right talent.
big data to complement official statistics, have a shortage of advanced analytical skills by comparison with developed economies.
Current research suggests that new big data sources have great potential to complement official statistics and produce insightful
of big data will have to overcome a number of barriers. This includes the development of models which protect user privacy while still
for using big data to complement official ICT statistics. Although this report highlights some of the big data sources and techniques that
could be used, further research is needed to understand and confirm the usefulness of big data sources for monitoring the information
society As with other official statistics, it is paramount for big data producers and big data users
to collaborate and to initiate a dialogue to identify opportunities and understand needs and constraints.
Since many of the big data sources lie within the private sector, close cooperation between NSOS, on the one hand
talk to commercial vendors of big data analytics In addition, operators and Internet companies can benefit greatly from engagement with
to leverage big data for different purposes Such engagement will also broaden their understanding of the limitations and assist them
the applications of data use for development are concerned, operators also have an interest in maximizing the economic well-being of their
Chapter 5. The role of big data for ICT monitoring and for development 204 privacy framework, in consultation with other
share their data with those from other sources including from competitors), but this is something that is worth exploring.
big data sources has great potential to increase added value and produce new insights. There is scope for exploring established models for such
some of their customer data with credit bureaux Governments Governments have different opportunities and different roles to play in the exploitation of big
data for monitoring and development. They can use big data to identify areas where rapid intervention may be necessary,
to track progress and make sure their decisions are based evidence and to strengthen accountability. More and more governments are recognizing the importance of
big data and have set up communities of practice and working groups to study their use and potential impact (UNSC 2013
setting big data standards. To this end, national regulatory authorities (NRAS) and NSOS, in consultation with other national stakeholders
big data clearing houses that promote analytical best practices in relation to the use of big data for complementing official statistics and for
development. Those standards, which NSOS are in the best position to enforce, would also have
to encompass best practices in relation to data curation and metadata standards. To this end NSOS must also prioritize the upgrading of the
handle big data, while at the same time investing in the necessary computational infrastructure As the main regulatory interface to the telecom
big data may be leveraged for social good Regulators have a role to play in facilitating the introduction of legislation that addresses privacy
concerns while encouraging data sharing in a secure manner. The following recommendations were made in a recently published ITU draft
In return, data users would be permitted to reuse personal data for novel purposes where a privacy assessment indicates
While the use of big data can help better decision-making through probabilistic predictions, this information should not be used against citizens
can utilize big data predictions â¢Fostering big data competition and openness: Regulators could foster big data competition in increasingly
concentrated big data markets, including by ensuring that data holders allow others to access their data under fair and
reasonable terms 205 Measuring the Information Society Report 2014 International stakeholders International stakeholders â including UN
agencies and initiatives (such as ITU and UN Global Pulse), the Partnership on Measuring ICT for Development, ICT industry associations
and producers of big data (Google, Facebook etc.)) â have an important role globally. More work is needed to understand fully the potential
of big data and examine the challenges and opportunities related to big data in the ICT sector. To this end, the key international
stakeholders have to work together to facilitate the global discussion on the use of big data
UN Global Pulse, as one of the main UN initiatives exploring the use of big data,
can do much to inform and motivate the discussion on global best practices and the use of big data for
development Where using big data for monitoring the information society is concerned, new partnerships, including public-private
partnerships between data providers and the ICT statistical community, including ITU, could be formed to explore new opportunities and
address challenges, including in the area of international data comparability and standards As one of the main international bodies working
on issues related to the telecommunication sector, ITU could leverage its position to facilitate global discussion on the use of telecom big data
for monitoring the information society Together, ITU and UN Global Pulse could facilitate the work that needs to be done by
NRAS and NSOS, through awareness raising and engagement on privacy frameworks, data sharing, and analytical global best practices
ITU could help reduce the transaction costs associated with obtaining telecommunication big data, for example by facilitating the
standards-setting process. Standardized contracts for obtaining data access as well as standards on how the data are stored,
collated and curated can collectively reduce the overall transaction costs of accessing and leveraging telecommunication big data for social good
Academia, research institutes and develop -ment practitioners The research into how telecom data may be used to aid broader development is being done
mainly by academia, public and private research institutes and, to a lesser degree, development practitioners. This makes them important
stakeholders in defining the state of the art with respect to leveraging big data for development They, more than others, have been the first to
engage with telecommunication operators with a view to using their data for development They therefore understand the potential and
challenges from multiple perspectives. Their collective experiences will be valuable as big data for development becomes mainstreamed
207 Measuring the Information Society Report 2014 Chapter 5 Annex The mobile-telecommunication data that
operators possess can be classified into different types, depending on the nature of the information they produce.
They include traffic data, service access detail records, location and movement data, device characteristics, customer
details and tariff data Traffic data Operators use a range of metrics to understand and manage the traffic flowing through their
networks. These include â¢Data volume: both uplink and downlink volumes for Internet traffic can be captured at various levels of
disaggregation down to the individual subscriber, or even to the level of a base station (in the case of a mobile operator
or local switch (in PSTN networks. These can be analysed to understand subscriber demand for data at both an individual
level and at aggregate levels, and the understanding thus gained can be used for billing purposes and for network
management â¢Erlang: a dimensionless metric used by mobile network operators to understand the offered and utilized network load. 41
Erlang data are used to understand the load on a base station at any given time
the entire data stream to be captured and stored, owing to the storage requirements that would be needed and also to
The most common use of such data is for basic billing purposes, in addition to which they can
Chapter 5. The role of big data for ICT monitoring and for development 208 Location and movement data
Mobile networks can, depending on their sophistication, capture a range of movement and location variables, which can be broadly
active positioning data (see Annex Box 1 Annex Box 1: Active versus passive positioning data
Passive positioning transaction generated data (TGD) is automatically generated by the network and captured in the
operatorâ s logs for billing and network management purposes to understand network load and to keep track of the handset in
Active positioning data (which is of relevance only to mobile networks) is location and movement data that is captured in response to a specially initiated network
query to locate a handset using either network or handset-based positioning methods. GPS location data can also be considered
as active positioning data Active positioning data Active positioning data can be generated using either device
-centric or network-centric methods, as well as via satellite i e. GPS. The use of these methods has developed either in
response to national regulations requiring operators to capture higher-precision location data, and/or to a business case for
providing location-based services. The large-scale capture of such higher-resolution data is undertaken mainly by operators
in developed economies. Operators in developing economies use some of these methods, but often on a case-by-case basis
and not for their entire subscriber base. 43 However, this trend is currently changing, and an increasing number of regulators are
Passive positioning data Passive location data are contained in the subscriberâ s registration information, which includes some form of mailing address
billing address in the case of postpaid. While fixed-telephone network operators have access only to static location data
mobile networks have much richer and dynamic location data CDRS, SMS detail records and Internet access records are the
main sources of passive positioning data for mobile operators and reside in their data warehouses. These records include the
ID of the antenna (cell ID), which in turn has a geolocation an azimuth (i e. antenna orientation information) and an
angular tilt. 45 It is also possible to obtain such data in real time through data-mediation services,
but these are not universally implemented Passive location data from the billing records are obviously
sparse and generated only when the phone is used and when the network knows which cell a particular handset is currently
connected to. However, many operators choose not to archive these data if they do not have a business case to justify the
additional storage costs. Where they are archived, such cell -handoff data provide a time-stamped sequence of cells that the
phone was attached to, and provides for a rich mobility profile as compared to the event-based billing records
Passive positioning data based on cell IDS is inexpensive when compared to active positioning data, but the tradeoff
lies in their lower precision, usually at the level of network cells. This lower-resolution location estimate can range from
such passive location data, at an aggregate level (temporal and/or spatial) these data remain very valuable
Source: ITU Device characteristics All mobile user devices used to access mobile telecommunication services come with an
The Aoa method uses data from base stations that have been augmented with arrays of smart antennas.
Such location data have high spatial resolution, but are costly for operators to implement Enhanced observed time
items of demographic data during the customer registration process. These can include the customerâ s age, gender, billing address and
Tariff data Operators maintain the complete tariff sheet and billing records for their current and past services
Mobile operators can associate such data with traffic data to understand the revenue that is being generated by specific network elements
e g. base stations), not only retrospectively, but also, possibly, in real time Chapter 5. The role of big data for ICT monitoring and for development
210 1 The report of the UN Secretary-Generalâ s High-level Panel of Eminent Persons on the post-2015 Development Agenda
calls for a data revolution that âoewould draw on existing and new sources of data to fully integrate statistics into decision-making
and use of, data and ensure increased support for statistical systems. â This suggests that efforts to
and complement, official statistics have turned to the search for new data sources, including big data. In addition, the European Statistical System Committee (ESSC) in 2013 adopted the Scheveningen Memorandum
on  Big data and Official Statisticsâ, which acknowledges that Big data represents new opportunities and challenges for
Official Statistics, and which encourages the European Statistical System and its partners to effectively examine the potential
of Big data sources, see: http://epp. eurostat. ec. europa. eu/portal/page/portal/pgp ess/0 docs/estat/SCHEVENINGEN
MEMORANDUM%20final%20version. pdf 2 This term was discussed first in 1991, although the term then used was generated âoetransaction informationâ (Mcmanus, 1990
on the protection of individuals with regard to the processing of personal data and on the free movement of such data
to which personal data cannot legitimately be processed without the consent of the data subject, except if necessary to
of the data subject, who should be given clear and comprehensive information as to the manner and purpose of such
http://policyreview. info/articles/analysis/big data-big-responsibilities 6 See https://www. google. org/denguetrends
/7 A good example of this is the Conference Board Help Wanted Online (HWOL) data series that measures the number of new
http://www. conference-board. org/data/helpwantedonline. cfm 8 See http://bpp. mit. edu /9 According to Peerreach. com, 20 per cent of Indonesiaâ s online population uses Twitter, the second highest ratio in the world
/11 ITU World Telecommunication/ICT Indicators database, 17th edition, 2014, available at http://www. itu. int/en/ITU-D/Statistics/Pages/publications/wtid. aspx
15 Deep packet inspection (DPI) is a process that utilizes specialized software to scan all of the data packets traversing a
and analysis of health data about a clinical syndrome that has a significant impact on public health, with the data in question being used to drive decisions about health policy and health education
22 A vector-borne disease is a disease that is transmitted through an agent (person, animal or microorganism
23 A geographic information framework is a representation framework that codes the components of geospatial data in a
standardized manner so as to facilitate analyses and data exchange 24 See http://www. itu. int/en/ITU-D/Statistics/Pages/intlcoop/partnership/default. aspx for more information regarding the
30 More information about the Data for Development (D4d) challenges using Orange data can be found at
32 More information about the Yale university Open Data (YODA) project can be found at http://medicine. yale. edu/core/projects/yodap/index. aspx
42 DPI is a process that utilizes specialized software to scan all of the data packets traversing a particular IP network.
Chapter 5. The role of big data for ICT monitoring and for development 212 45 Most network operators use multiple sectorized antennas on a single base station.
47 An international mobile subscriber identity (IMSI) number is a 15-digit number unique to the particular SIM in a subscriberâ s
the data deluge makes the scientific method obsolete. Wired Magazine, 16 (7 Askitas, N. and Zimmermann, K. F. 2009), Google econometrics and unemployment forecasting.
Azman, A s.,Urquhart, E. A.,Zaitchik, B. and Lessler, J. 2013), Using Mobile phone Data to Supercharge Epidemic Models of
Data for Development: Net Mobi 2013 Bengtsson, L.,Lu, X.,Thorson, A.,Garfield, R. and von Schreeb, J. 2011), Improved response to disasters and outbreaks by
tracking population movements with mobile phone network data: a post-earthquake geospatial study in Haiti. PLOS
mobility and optimizing public transport using cellphone data. Data for Development: Net Mobi 2013 Blumenstock, J. E. 2011), Using mobile phone data to measure the ties between nations.
Proceedings of the 2011 iconference pp. 195â 202. New york, USA: ACM Press. doi: 10.1145/1940761.1940788
Blumenstock, J. E. 2012), Inferring patterns of internal migration from mobile phone call records: evidence from Rwanda
Bollier, D. 2010), The Promise and Peril of Big data. WASHINGTON DC Bouckaert, J.,Van dijk, T. and Verboven, F. 2010), Access regulation, competition, and broadband penetration:
Boyd, D. and Crawford, K. 2012), Critical questions for big data. Information, Communication & Society, 15 (5), 662â 679. doi:
How Does driven Data Decisionmaking Affect Firm Performance? Retrieved from: http://papers. ssrn. com/sol3/papers. cfm?
Byrne, C. 2013), How The UNÂ s New Data Lab in Indonesia Uses Twitter to Preempt Disaster, Fast Company.
http://www. fastcolabs. com/3007178/open-company/how-uns-new-data-lab-indonesia-uses-twitter
Caceres, N.,Wideberg, J. P. and Benitez, F. G. 2007), Deriving originâ destination data from a mobile phone network.
Calabrese, F.,Di Lorenzo, G.,Liu, L. and Ratti, C. 2011), Estimating origin-destination flows using mobile phone location data
Global Mobile Data Traffic Forecast Update, 2013-2018, White paper, February 2014 Available at: http://www. cisco. com/c/en/us/solutions/collateral/service-provider/visual-networking-index-vni/white
David, T. 2013), Big data from Cheap Phones. Technology Review, 116 (3), 50â 54 De Montjoye, Y.-A.,Hidalgo, C a, Verleysen, M. and Blondel, V. D. 2013), Unique in the Crowd:
D. 2009), Inferring friendship network structure by using mobile phone data. Proceedings of the National Academy of Sciences of the United states of america, 106 (36), 15274â 8. doi:
using search engine query data. Nature, 457 (7232), 1012â 4. doi: 10.1038/nature07634 Groves, R. M. 2011), A possible data future for the observational social sciences.
COSSA 30th Anniversary meeting Washington, D c Gruber, H. and Koutroumpis, P. 2013), Competition enhancing regulation and diffusion of innovation:
GSMA (2013b), The Mandatory Registration of Prepaid SIM CARD Users, White paper. Retrieved from: http://www. gsma
com/publicpolicy/wp-content/uploads/2013/11/GSMA WHITE-Paper mandatory-Registration-of-Prepaid-SIM -Users 32pgwebv3. pdf References
Harford, T. 2014), Big data: are we making a big mistake? Financial times. pp. 7â 11. Retrieved 28 march 2014 from
The real-world use of big data. How innovative enterprises extract value from uncertain data. Retrieved from:
http://public. dhe. ibm. com/common/ssi/ecm/en/gbe03519usen/GBE03519USEN. PDF IBM (2013), The Four Vâ s of Big data.
Retrieved from http://www. ibmbigdatahub. com/sites/default/files/infographic file/4-Vs-of-big data. jpg
ICT Qatar (2014), Qatarâ s ICT landscape 2014 households and individuals. Available at: http://www. ictqatar. qa/en/documents
ITU (2011), Handbook for the collection of administrative data on Telecommunications/ICT. Available at http://www. itu. int/en/ITU-D/Statistics/Pages/publications/handbook. aspx
ITU (2013), Big data: Big today, normal tomorrow. Retrieved from: http://www. itu. int/oth/T2301000022/en
ITU GSR discussion paper (2014), Big data-Opportunity or Threat. Retrieved from: http://www. itu. int/en/ITU-D/Conferences
Krumholz, H. M. 2014), Give the data to the People, New york times. Retrieved 2 february 2014 from
http://www. nytimes. com/2014/02/03/opinion/give-the-data-to-the-people. html? hp&rref=opinion& r=1
Laney, D. 2001), 3d data management: Controlling data volume, velocity and variety. Gartner, 6 Lazer, D.,Kennedy, R.,King, G. and Vespignani, A. 2014), Big data.
The parable of Google Flu: traps in big data analysis. Science New york, N y.),343 (6176), 1203â 5. doi:
10.1126/science. 1248506 Lazer, D.,Pentland, A.,Adamic, L.,Aral, S.,Barabasi, A l.,Brewer, D.,Van alstyne, M. 2009), Computational social science
Science (New york, N y.),323 (5915), 721â 3. doi: 10.1126/science. 1167742 Lenormand, M, Picornell, M.,Cantu-Ros, O.,Tugores, A.,Louail, T.,Herranz, R.,Barthelemy, M.,Frias-Martinez, E.,Ramasco, J
Lokanathan, S.,Silva, N. de, Kreindler, G.,Miyauchi, Y. and Dhananjaya, D. 2014), Using mobile network big data for informing
Manyika, J.,Chui, M.,Brown, B.,Bughin, J.,Dobbs, R.,Roxburgh, C. and Byers, A. H. 2011), Big data:
Data for Development: Net Mobi 2013 Mayer-Schã nberger, V. and Cukier, K. 2013), Big data:
A revolution that will transform how we live, work, and think. Houghton Mifflin Harcourt Mcafee, A. and Brynjolfsson, E. 2012), Big data:
the management revolution. Harvard Business Review, 90 (10), 60â 6, 68,128 Retrieved from: http://www. ncbi. nlm. nih. gov/pubmed/23074865
What big data means for social science. Headcon â 13 Part I. Retrieved from http://edge. org/panel/headcon-13-part-i
Naef, E.,Muelbert, P.,Raza, S.,Frederick, R.,Kendall, J. and Gupta, N. 2014), Using mobile data for development.
Pindolia, D. K.,Garcia, A j.,Wesolowski, A.,Smith, D. L.,Buckee, C. O.,Noor, A m.,Tatem, A j. 2012), Human movement data
2012), Demystifying Big data: A Practical Guide To Transforming The Business of Government Retrieved from: http://www. techamerica. org/Docs/filemanager. cfm?
The White house (2014), Big data: Seizing Opportunities, Preserving values. Executive Office of the President. Retrieved from
Using New Data to Understand Emerging Vulnerability in Real-time UN High-level Panel (2013), A new global Partnership:
UNSC (2013), Report of the Secretary general on Big data and modernization of statistical systems UNSC (2014), Big data and modernization of statistical systems, December 2013.
Available at http://unstats. un. org/unsd/statcom/doc14/2014-11-Bigdata-E. pdf Van damme, E. 2002), The European UMTS-auctions, European Economic Review, Volume 46, Issues 4-5 may 2002, pp. 846
Varian, H r. 2013a), Beyond Big data. NABE Annual Meeting. San francisco, CA Varian, H r. 2013b), Big data: New Tricks for Econometrics
Wallsten, S. 2001), An econometric analysis of telecom competition, privatization, and regulation in Africa and Latin america
A. 2013), Undefined By Data: A Survey of Big data Definitions, 2. Databases. Retrieved from http://arxiv. org/abs/1309.5821
Waverman, L. and Koutroumpis, P. 2011), Benchmarking telecoms regulation: The Telecommunications Regulatory Governance Index (TRGI.
Weber, N m.,Palmer, C. L. and Chao, T. C. 2012), Current Trends and Future Directions in Data Curation Research and Education
Rewards and Risks of Big data (B. Bilbao-Osorio, S. Dutta and B Lanvin, Eds References
Using Cellular network Data Data for Development: Net Mobi 2013 Xia, J. 2011), The third-generation-mobile (3g) policy and deployment in China:
Current status, challenges, and prospects Telecommunications Policy, Volume 35, Issue 1 february 2011, pp. 51-63. doi:
objectives, data availability and the results of various statistical analyses such as the principal component analysis (PCA.
Data for all of these indicators are collected by ITU. 2 1. Fixed-telephone subscriptions per 100
data cards or USB modems, subscriptions to public mobile data services, private trunked mobile radio, telepoint, radio paging and
telemetry services 3. International Internet bandwidth (bit/s) per Internet user International Internet bandwidth refers to the
bandwidth, in megabits per second (Mbit/s It is measured as the sum of used capacity of
Data are obtained by countries through national household surveys and are provided either directly to ITU by national statistical offices
There are certain data limits to this indicator insofar as estimates have to be calculated for many developing countries which do not yet
more data become available, the quality of the indicator will improve 5. Percentage of households with Internet
Data are obtained by countries through national household surveys and are provided either directly to ITU by national statistical offices
There are certain data limits to this indicator insofar as estimates have to be calculated for many developing countries which do not yet
more data become available, the quality of the indicator will improve b) ICT use indicators
Data for all of these indicators are collected by ITU. 4 1. Percentage of individuals using the Internet
Data are obtained by countries through national household surveys and are provided either directly to ITU by national statistical offices
There are certain data limits to this indicator insofar as estimates have to be calculated for many developing countries which do not yet
more data become available, the quality of the indicator will improve 2. Fixed (wired)- broadband subscriptions per
mobile-broadband data subscriptions to the public Internet. It covers actual subscribers, not potential subscribers
with advertised data speeds of 256 kbit/s or greater that allow access to the
been used to set up an Internet data connection using Internet Protocol (IP) in the past three months.
Internet data connection, even if the messages are delivered via IP. Dedicated mobile-broadband data subscriptions refers to subscriptions to dedicated data
services (over a mobile network) that allow access to the greater Internet and which are purchased separately from
voice services, either as a standalone service (e g. using a data card such as a USB modem/dongle) or as an add-on data
package to voice services which requires an additional subscription. All dedicated mobile-broadband subscriptions with
recurring subscription fees are included regardless of actual use. Prepaid mobile -broadband plans require use if there is no
Data on adult literacy rates and gross secondary and tertiary enrolment ratios are collected by the
data A critical step in the construction of the index is to create a complete data set, without missing values
There are several imputation techniques that can be applied to estimate missing data. 6 Each of the
imputation techniques, like any other method employed in the process, has its own strengths and
ensure that the imputed data will reflect a countryâ s actual level of ICT access, usage and skills
missing data, where previous year data are not available to calculate the growth rates. Hot -deck imputation uses data from countries with
âoesimilarâ characteristics, such as GNI per capita and geographic location. For example, missing data for country A were estimated for a certain indicator by
first identifying the countries that have similar levels of GNI per capita and that are from the same region
use data of country A was estimated by using Internet use data of country B from the same region
with similar level of GNI per capita and similar level of fixed Internet and wireless-broadband
estimate missing data for all indicators included in the index 3. Normalization of data Normalization of the data is necessary before any
aggregation can be made in order to ensure that the data set uses the same unit of measurement For the indicators selected for the construction
of the IDI, it is important to transform the values to the same unit of measurement, since
some values are expressed as a percentage of the population/of households, whereby the maximum value is 100, while other indicators
of the huge dispersion of values, the data were transformed first to a logarithmic log) scale. Outliers were identified then
After normalizing the data, the individual series were rescaled all to identical ranges, from 1 to
the data were transformed first to a logarithmic (log) scale The ideal value of 787â 260 bit/s per Internet user is equivalent to 5. 90 if transformed to a log scale
weighting and aggregation of the data Each of the processes or combination of processes affects the IDI value.
1 Principal component analysis was used to examine the underlying nature of the data. A more detailed description of the
2 More information about the indicators is available in the ITU â Handbook for the collection of administrative data on
EGH. pdf). As some of the data used in the calculation of the IDI were collected before that meeting,
however, the data may not necessarily reflect these revisions 4 More information about the indicators is available in the ITU âoehandbook for the collection of administrative data on
telecommunications/ICTÂ â 2011, see ITU 2011b and the ITU âoemanual for Measuring ICT Access and Use by Households and
Annex 2. ICT price data methodology 1. Price data collection and sources The price data presented in this report were
collected in the fourth quarter of 2013. The data were collected through the ITU ICT Price Basket
questionnaire, which was sent to the administrations and statistical contacts of all 193 ITU Member States in October 2013.
Through the questionnaire contacts were requested to provide 2013 data for fixed-telephone, mobile-cellular, fixed-broadband
and mobile-broadband prices; the 2011 and 2012 prices were included for reference, where available. For those countries that did not reply
The collection of price data from ITU Member States and the methodology applied for the IPB was agreed upon by the ITU Expert
Annex 2. ICT price data methodology 232 The fixed-telephone sub-basket does not take
for a SIM CARD. The basket gives the price of a standard basket of mobile monthly usage in USD
6. The same price plan should be used for collecting all the data specified. For example, if a given Plan A is selected for the fixed
for data collection, since it may not be possible to isolate the prices for one service. It is preferable to use prices for a specific
Annex 2. ICT price data methodology 234 Annex Table 2. 1: OECD mobile-cellular low-user call distribution (2009 methodology
data usage of (a minimum of) 1 Gigabyte GB). ) For plans that limit the monthly amount
of data transferred by including data volume caps below 1 GB, the cost for the additional
bytes is added to the sub-basket. The minimum speed of a broadband connection is 256 kbit/s
Annex 2. ICT price data methodology 236 Annex Box 2. 3: Rules applied in collecting fixed-broadband Internet prices
7. The same price plan should be used for collecting all the data specified. For example, if a given Plan A is selected for the fixed
the volume of data that can be downloaded, etc 8. Prices should be collected for regular (non-promotional) plan
for price data collection, since it may not be possible to isolate the prices for one service.
GB of data volume. If providers set a limit of less than 1 GB on the amount of data that can be
transferred within a month, then the price per additional byte is added to the monthly price
so as to calculate the cost of 1 GB of data per month. Preference should be given to the most
widely used fixed (wired)- broadband technology DSL, cable, etc..The sub-basket does not include installation charges, modem prices or
-broadband price data from ITU Member States was agreed upon by the ITU Expert Group on
lessons learned from the first data collection exercise. The revised methodology was endorsed by the eleventh World Telecommunication/ICT
To capture the price of different data packages covering prepaid and postpaid services, and supported by different devices (handset and
collected for two different data thresholds, based on a set of rules (see Annex Box 2. 4
of data allowance (or validity. The customer i) continues to use the service and pays an
offers that include the minimum amount of data for each respective mobile-broadband plan The guiding idea is to base each plan on what
data allowance and validity of each respective plan Annex 2. ICT price data methodology 238
Annex Box 2. 4: Rules applied in collecting mobile-broadband prices11 1. Prices should be collected based on one of the following technologies:
plans, the plan satisfying the indicated data volume requirement should be used 8. Where operators propose different commitment periods for postpaid mobile-broadband plans,
9. Price data should be collected for the cheapest plan with a data volume allowance of a minimum of
Data volumes should refer to both upload and download data volumes. If prices are linked to â hours of useâ and not to data volumes, this information should be added in a separate note
Note: ITU will most likely not be able to include these cases in a comparison
10. A validity period of 30 days should be chosen. If this is not available, 15 days should be used.
Preference should be given to packages (including a certain data volume. Pay-as-you-go offers should be used
since most often there are limits in the data volumes, either applied by throttling (limiting the speed)
of ITU supply-side indicators (i e. data collected from operators), as well as to discuss outstanding methodological issues and
EGTI is open to all ITU members and experts in the field of ICT statistics and data collection.
8 Data for fixed-telephone, mobile-cellular and fixed-broadband have been collected since 2008 through the ITU ICT Price
10 Some operators throttle speeds after the data allowance included in the base package has been reached.
Data in italics refer to ITU estimates. For further notes, see p. 248 Source: ITU World Telecommunication/ICT Indicators database
Annex 3. Statistical tables of indicators used to compute de IDI 244 Percentage of individuals using the Internet
Data in italics refer to ITU estimates. For further notes, see p. 248 Source: ITU World Telecommunication/ICT Indicators database
Annex 3. Statistical tables of indicators used to compute de IDI 246 Skills indicators Gross enrolment ratio Adult
Data in italics refer to ITU estimates Source: ITU World Telecommunication/ICT Indicators database Annex 3. Statistical tables of indicators used to compute de IDI
248 Access indicators Fixed-telephone subscriptions per 100 inhabitants, 2012 1) Incl. 524 958 WLL subscriptions. 2) Incl. payphone, excl.
change in tariff policy of the biggest WLL operator. 13) Data excluding own (NRA) consumption. 14) Excl. voice-over-IP (Voip
on 2013q3 data. 19) Refers to active Fixed Wired/Wireless lines. 20) Per June 2013.21) Operatorsâ data. 22) Residential:
Definitive data (annual report) may change because quarterly reports use a smaller sample of operators than annual report. 24) Fixed and fixed-wireless subscriptions. 25) Excl. internal lines and WLR of
Data for the third quarter of 2013 Mobile-cellular subscriptions per 100 inhabitants, 2012 1) Numbers are down due to data cleanse. 2) ACMA Communications Report 2011-12.2) incl. payphone, excl.
Voip. 3) Active subscriptions. 4) Bhutan Telecom and Tashi Cell are the only two service providers in Bhutan. 5) Activity criteria:
voice or data communication in the last month. 6 december 2012.7) Total number of subscriptions (including non-active:
Excl. 2 720 698 prepaid cards that are used to provide Travel SIM/World Mobile service. 9) Excl. data-only SIM CARDS and M2m
Including PHS and data cards undividable. 15) Decrease was due to registration of SIMS. 16) Figure obtained from all five mobile (GSM & CDMA) operators
currently providing service in the country. 17) Incl. inactive subscriptions. 18) Preliminary. 19) Active subscriptions (87.85
Incl. inactive. 22) Incl. data-only subscriptions not possible to disaggregate the information at this point.
24) Incl. active (in the last 6 months) prepaid accounts. 25) Registered SIM CARDS (incl. inactive:
active and non-active subscriptions. 28) Incl. data dedicated subscriptions. 29) Decrease due to the closing of MTS
but excl. mobile data subscriptions (via data cards, USB modems and M2m cards). 7) Estimate
8) Incl. data-only subscriptions. 9) Data based on NRA estimates. Excl. data-only SIM CARDS and M2m cards. 10) Incl. fixed
wireless local loop (WLL) subscriptions. 11) Incl. PHS and data cards, undividable. 12) Active subscriptions (after clearing
simcards that became inactive. 13) Figure obtained from all four mobile (GSM & CDMA) operators currently providing
16 february 2013 NTA MIS. 17) Based on 2013q3 data. 18) Measured using subscriptions active in the last 90 days. 19) Per
June 2013.20) Excl. 463 646 M2m subscriptions. 21) Incl. active (in the last 6 months) prepaid accounts. 22) Registered SIM
Definitive data (annual report) may change because quarterly reports use a smaller sample of operators than annual report. 26) Data for the fourth quarter of 2013.27) Incl. data dedicated subscriptions
28) Reduction is due to implementation of sim cards registration International Internet bandwidth Bit/s per Internet user, 2012
1) Refers to a survey conducted with the following companies: Global crossing, TIWS, Embratel e Globenet. 2) This is from
MOT. 11) Data obtained from nine service operators. 12 may 2012 purchased capacity. Lit capacity: 43 096 471 Mbit/s. 13) Incoming capacity;
weekly incoming capacity, averaged over 4 weeks in December 14) Preliminary. 15) SLT Data. 16) Refers to the total capacity
4 may be revised with comprehensive data from mobile broadbnad providers. 5) Estimate. 6 june. 7) Sum of incoming capacity of all ISPS in the country. 8) Activated external
capacity. 9) By September 2013.10) Data obtained from eight service operators. 11) 1st april 2013 purchased capacity.
or laptop. 4) Data correspond to dwellings (not households). 5) Ghana Living Standards Survey 2012/2013
-Economic survey-2012.9) Census data. 10) Computer includes the number of personal computer, Notebook, and PDA
1) Labour force Survey 2013.2) Cambodia Inter-censal Population Survey. 3) Refers to PC, laptop or a tablet. 4) Data
%2) Preliminary. 3) Data correspond to dwellings (not households. 4) Ghana Living Standards Survey 2012/2013.
via mobile modem. 10) Census data. 11) Excl. households which didnâ t know type of internet access 172 346 households
1) Labour force Survey 2013.2) Corresponds to all type of internet connections. 3) Data correspond to dwellings (not
instead of data survey. 13) Ghana Living Standards Survey 2012/2013. The estimate is based on weighting households who use
based on the data provided by 89.1%of operators 6) The figure is corrected. The previous figure was 1â 636â 700.7) Only ADSL, excl. cable modem. 8) Speeds greater than, or
than 144 kbit/s. 14) Operators data/ictqatar estimate. 15) Incl. subscriptions at downstream speeds equal to, or greater than
subscriptions. 18) Excl. corporate connections. 19) Data reflect subscriptions with associated transfer rates exceeding 200
and Fiber links. 5) Estimate, no specific data collected for â ¥ 256 kbit/s. 6) CRC estimation as of 31.12.2013.7) Estimate. 8) Data
Broadband in India with effect from 18th july 2013.12) December 13) ADSL and Leased lines. 14) Based on 2013q3 data
15) Per June 2013.16) Operatorsâ data. 17) Incl. subscriptions at downstream speeds equal to, or greater than, 144 kbit/s (the
Definitive data (annual report may change because quarterly reports use a smaller sample of operators than annual report. 19) Estimate.
2013.20) Excl. 3175 Wimax subscriptions. 21) Excl. corporate connections. 22) 2013 data is an estimate as of June 30, 2013
Data reflect subscriptions with associated transfer rates exceeding 200 kbps in at least one direction, consistent with the
12) Satellite, BWA and active mobile subscriptions. 13) Estimate based on partial SIT data and ITU estimates. 14) Speeds
/Data refer to the sum of fixed wireless broadband and active mobile -broadband subscriptions. 16) Incl. mobile broadband and Wimax. 17) Estimate. 18) ETL and LTC. 19) Incl. narrowband
but data are not available. 23) Operators data/ictqatar estimate. 24) Refers to active mobile-broadband
effect from 18th july 2013.16) Data refer to the sum of fixed wireless broadband and active mobile-broadband subscriptions
17) 2013 data is an estimate as of June 30, 2013.18) Incl. mobile broadband and Wimax. 19) Estimate based on 1. Standard
mobile subscriptions using data services 2. Dedicated data subscriptions 3. Add on data packages. 20) Based on 2013q3
data. 21) Per June 2013.22) Mobile broadband only. Fixed wireless and satellite exist but data are not available. 23) Operatorsâ
data. 24) As at Dec 2013.25) Q4 report. Definitive data (annual report) may change because quarterly reports use a smaller
sample of operators than annual report. 26) Wireless Broadband services are not being offered in St vincent as yet.
We anticipate that Mobile broadband and terrestrial fixed broadband services would be in place by the end of 2014.27) OFCOM
Chapter 5. The role of big data for ICT monitoring and for development 5. 1 Introduction
5. 2 Big data sources, trends and analytics 5. 3 Telecommunication data and their potential for big data analytics
5. 4 Big data from mobile telecommunications for development and for better monitoring 5. 5 Challenges and the way forward
List of references Annex 1. ICT Development Index (IDI) methodology Annex 2. ICT price data methodology
Annex 3. Statistical tables of indicators used to compute the IDI
Overtext Web Module V3.0 Alpha
Copyright Semantic-Knowledge, 1994-2011