Explanation of the new feature of Hadoop 2.X, the recycle bin function

By turning on the Recycle Bin function, you can restore the original data of deleted files without timing out, thus preventing accidental deletion and providing backup.

1. Recycle Bin parameter settings and working mechanism

2. Enable Recycle Bin

Modify core-site.xml and set the garbage collection time to 1 minute.

<property>
  <name>fs.trash.interval</name>
<value>1</value>
</property>

3. View Recycle Bin

The path of the recycle bin in the cluster: /user/username/.Trash/….

4. Modify the user name for accessing the Recycle Bin

The user name for entering the trash bin is dr.who by default, change it to the user name

[core-site.xml]

<property>
 <name>hadoop.http.staticuser.user</name>
 <value>Username</value>
</property>

5. Files deleted by the program will not go through the Recycle Bin. You need to call moveToTrash() to enter the Recycle Bin.

Trash trash = New Trash(conf);
trash.moveToTrash(path);

6. Restore Recycle Bin Data

hadoop fs -mv
/user/atguigu/.Trash/Current/user/atguigu/input /user/atguigu/input

7. Empty the Recycle Bin

hadoop fs -expunge

Summarize

The above is the full content of this article. I hope that the content of this article will have certain reference learning value for your study or work. Thank you for your support of 123WORDPRESS.COM. If you want to learn more about this, please check out the following links

You may also be interested in:

Hadoop NameNode Federation
Application of Hadoop counters and data cleaning
A practical tutorial on building a fully distributed Hadoop environment under Ubuntu 16.4
Hadoop 2.x vs 3.x 22-point comparison, Hadoop 3.x improvements over 2.x
How to build a Hadoop cluster environment with ubuntu docker
Detailed steps to build Hadoop in CentOS
Hadoop wordcount example code
Java/Web calls Hadoop for MapReduce sample code
Explanation of the working mechanism of namenode and secondarynamenode in Hadoop

<<: The perfect solution for MySql version problem sql_mode=only_full_group_by

>>: Detailed explanation of how to use binlog2sql to quickly roll back after MySQL misoperation

Explanation of the new feature of Hadoop 2.X, the recycle bin function

How to install Docker on Raspberry Pi

Detailed example of database operation object model in Spring jdbc

JavaScript canvas to load pictures

How to query whether the mysql table is locked

Overview and application of position attributes (absolute|relative|static|fixed) in CSS

Detailed explanation of soft links and hard links in Linux

How to make if judgment in js as smooth as silk

Mysql backup multiple database code examples

Ubuntu 18.04 obtains root permissions and logs in as root user

Tutorial on using portainer to connect to remote docker

Recommend

Summary of tips for making web pages

The image element img has extra blank space in IE6

Docker private repository management and deletion of images in local repositories

Detailed explanation of the redirection configuration and practice of Rewrite in Nginx

Vue implements book management case

Docker starts MySQL configuration implementation process

MySQL slow query and query reconstruction method record

Getting started with JavaScript basics

Summary of Linux environment variable configuration methods (differences between .bash_profile and .bashrc)

Detailed usage of Vue timer

What is the function and writing order of the a tag pseudo class

A brief discussion on when MySQL uses internal temporary tables

How to restore data using binlog in mysql5.7

Website User Experience Design (UE)

Detailed example of using js fetch asynchronous request