Document 251822

Big Data
capture
storage
sharing transfer
•
WHERE
• WHAT
• WHY
• HOW
asv[s]://<container>@<account>.blob.core.windows.net/<path>
<property>
<name>fs.azure.account.key.accountname</name>
<value>enterthekeyvaluehere</value>
</property>
Container
Blob Name
mycontainer
a.txt
mycontainer
b.txt
mycontainer
dir1\c.txt
mycontainer
dir1\dir2\d.txt
Target Destination
Tool / Library
Requires Active HDInsight
Cluster
SQL Server,
Azure SQL DB
Sqoop (Hadoop ecosystem project)
Yes
Excel
Codename “Data Explorer”
No
Another Blob Storage
Account
Azure Blob Storage REST APIs (Copy Blob, etc)
No
SQL Server Analysis
Services
Hive ODBC Driver
Yes
Existing BI Apps
Hive ODBC Driver (assumes app supports ODBC Yes
connections to data sources)
Hive, Pig, Mahout, Cascading, Scalding, Scoobi, Pegasus…
C#, F# Map/Reduce, LINQ to Hive, .NET management clients
JavaScript Map/Reduce, Browser hosted console, Node.js management clients
PowerShell, Cross Platform CLI tools, SSIS Custom tasks
 Sources
http://hadoopsdk.codeplex.com
 http://www.github.com/windowsazure

 NuGet packages
Microsoft.Hadoop.MapReduce
 Microsoft.Hadoop.Hive
 Microsoft.Hadoop.WebHDFS => WebClient

 NPM packages
Azure
 Azure-cli

http://channel9.msdn.com/Events/TechEdEurope
www.microsoft.com/learning
http://microsoft.com/technet
http://microsoft.com/msdn