UNIT 4 - BIG DATA AND PRIVACY LESSON 1: WHAT IS BIG DATA VOCABULARY ALERT! Big Data - a broad term for datasets so large or complex that traditional data processing applications are inadequate. Moore's Law - a predication made by
Gordon Moore in 1965 that computing power will double every 1.5-2 years, it has remained more or less true ever since. PURPOSE Big data is a big deal right now, both in the field of computer science and more broadly across
fields and industries. Where does Big Data come from, what makes it big, and how people use big data to solve problems? How much of your lives are datafied or could be? Who
knows what about you? WATCH VIDEO (15 MINUTES): Video: Big data is better data - TED talk Video What is Big Data? Big data means different things, at different times, to different people.
It can mean devices that are constantly collecting data. It can mean digitizing data thats been around for a long time (e.g., every book ever written). It can mean machine learning and artificial intelligence. You have this in Code Studio: The IDCs Digital Universe is described
as a measure of all the digital data created, replicated, and consumed in a single year. source: http://www.emc.com/co llateral/analyst-reports/ idc-the-digital-universein-2020.pdf
Part of what contributes to data being "big" is the sheer growth of the amount of data in the world. Lets have a look at a graph that shows us just how large big data is. As you can see from the chart, the amount of data flying around is growing exponentially, doubling every two years or so. Heres a way to think about how fast this is: The world will produce as much digital data over the next 2 years, as currently
existed in all of humanity prior to that. And it will do the same the 2 years after that. And so on. Thats a lot! Moores Law Moore's law is actually about computing power, not data, but data growth seems to following the same trend So far, computing power/capacity seems to double
every 1.5-2 years... That means it grows exponentially... Exponential growth is hard for humans to fathom... Yet we need to plan for it. Distribute: Activity Guide - Big Data Sleuth C ard - Activity Guide .
Big data surrounds us but it is sometimes surprisingly challenging to get access to it, use it, or see it. Much of the data out there is in the wild. Even when the data is available, it can sometimes be challenging to figure out where it came from, or
how to use it. Get into pairs and assign each pair one of the 5 websites listed. 1 Web archive http://www.archive.org 2 Measure of America http://www.measureofamerica.org/maps/ 3 Wind Sensor network http://earth.nullschool.net/ 4 Twitter sentiment https://www.csc.ncsu.edu/faculty/
healey/tweet_viz/tweet_app/ 5 Alternative Fuel Locator http://www.afdc.energy.gov/locator/stations/ Student tasks are to follow the resource and answer prompts related to: the visualization tool provided the original source of the data
evaluating the usefulness of both the data and the visualization. Here is a general-purpose definition of Big Data (taken from Wikipedia: Big Data): Big data is a broad term for datasets so large or complex that traditional data processing applications are inadequate." The fact that big data is increasingly important across industries reflects rapid changes in how much data we're collecting, and the
ways we're using it. In this unit we're going to be looking into how growth in data and computing more generally is impacting society. In almost every industry and every aspect our lives, computing and data is affecting our lives in both positive and negative ways. This will also be very useful preparation as we begin to look towards the Explore PT. Look at: printed copies of
College Board - Assessment Overv iew and Performance Task Directio ns for Students . We will review pages 4-6 which introduces the Explore PT Components Page 4: The Explore PT has 2 major components: 1. computational artifact,
2. written responses Pages 5-6: Skim the submission requirements and give students time to read prompts 2a - 2e. Highlight prompts 2c and 2d which references beneficial / harmful effects and the way computing innovations use data, themes of this unit. No homework today!
Turn in your Activity Guide for credit. GLAD YOU CAME BACK FOR TRI 2!!
Prep three questions or comments to hand in on Jan 17th. Use Moodle to turn in assignment. Chapter 3 - Differential gene expression. Figures assignments to do on Monday the 22nd? 3.25 - Matthew A. 3.26 - Rachel B. 3.27...
Basic Strategies For Meeting Uneven Demand. Level capacity strategy: Maintaining a steady rate of regular-time output while meeting variations in demand by a combination of options. Chase demand strategy: Matching capacity to demand; the planned output for a period is...
Introduction: The conventional monocentric model of urban employment was suggested to be largely irrelevant. Employment is decentralizing to the suburbs and centralizing within the suburbs. Currently two approaches: I. City's production and residential sectors are either segregated or integrated. II.
OVEC ELA Content Specialist www.DeniseAmos.com [email protected] 502-552-5735 Kansas Department of Education Text Complexity Resources ... High school textbooks have declined in all subject areas over the last several decades. Average length of sentences in K-8 textbooks has declined ...
Sona Systems . An online reservation system so you can keep track of your credits and your professors can also keep track of your credits. Professors do have access to Sona to keep track of the progress students are making...
Ready to download the document? Go ahead and hit continue!