Data Lake for Enterprises
上QQ阅读APP看书,第一时间看更新

Sqoop working example

We will be using Google Cloud Platform for running the whole use case that we will be covering in this book. Screenshots and code would be covered throughout this book with this in mind so that the reader at the end of this book would have a fully functioning Data Lake in the cloud which slowly could be connected to the real database existing in the enterprise.

Being the first chapter, which is now dealing with installation and code, this chapter will install certain softwares/tools/technologies/libraries that will be referred to in subsequent chapters. In the context of Sqoop, some installations and commands won't be required but
are needed for running all of these in the cloud having a clean node with nothing installed on it.

These examples have been prepared and tested on CentOS 7, and this would be our platform for all the examples covered in this book.