1 | <?xml version="1.0"?> |
---|
2 | <!-- |
---|
3 | Copyright 2002-2004 The Apache Software Foundation |
---|
4 | |
---|
5 | Licensed under the Apache License, Version 2.0 (the "License"); |
---|
6 | you may not use this file except in compliance with the License. |
---|
7 | You may obtain a copy of the License at |
---|
8 | |
---|
9 | http://www.apache.org/licenses/LICENSE-2.0 |
---|
10 | |
---|
11 | Unless required by applicable law or agreed to in writing, software |
---|
12 | distributed under the License is distributed on an "AS IS" BASIS, |
---|
13 | WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. |
---|
14 | See the License for the specific language governing permissions and |
---|
15 | limitations under the License. |
---|
16 | --> |
---|
17 | <!DOCTYPE document PUBLIC "-//APACHE//DTD Documentation V2.0//EN" "http://forrest.apache.org/dtd/document-v20.dtd"> |
---|
18 | <document> |
---|
19 | <header> |
---|
20 | <title>Hadoop Archives</title> |
---|
21 | </header> |
---|
22 | <body> |
---|
23 | <section> |
---|
24 | <title> ä»ä¹æ¯Hadoop archives? </title> |
---|
25 | <p> |
---|
26 | Hadoop archivesæ¯ç¹æ®çæ¡£æ¡æ ŒåŒãäžäžªHadoop archive对åºäžäžªæ件系ç»ç®åœã |
---|
27 | Hadoop archiveçæ©å±åæ¯*.harãHadoop archiveå
å«å
æ°æ®ïŒåœ¢åŒæ¯_indexå_masterindxïŒåæ°æ®ïŒpart-*ïŒæ件ã_indexæ件å
å«äºæ¡£æ¡äžçæ件çæ件ååäœçœ®ä¿¡æ¯ã |
---|
28 | </p> |
---|
29 | </section> |
---|
30 | <section> |
---|
31 | <title> åŠäœå建archive? </title> |
---|
32 | <p> |
---|
33 | <code>çšæ³: hadoop archive -archiveName name <src>* <dest></code> |
---|
34 | </p> |
---|
35 | <p> |
---|
36 | ç±-archiveNameé项æå®äœ èŠå建çarchiveçååãæ¯åŠfoo.harãarchiveçååçæ©å±ååºè¯¥æ¯*.harãèŸå
¥æ¯æ件系ç»çè·¯åŸåïŒè·¯åŸåçæ ŒåŒåå¹³æ¶ç衚蟟æ¹åŒäžæ ·ãå建çarchiveäŒä¿åå°ç®æ ç®åœäžã泚æå建archivesæ¯äžäžªMap/Reduce jobãäœ åºè¯¥åšmap reduceé矀äžè¿è¡è¿äžªåœä»€ãäžé¢æ¯äžäžªäŸåïŒ |
---|
37 | </p> |
---|
38 | <p> |
---|
39 | <code>hadoop archive -archiveName foo.har /user/hadoop/dir1 /user/hadoop/dir2 /user/zoo/</code> |
---|
40 | </p><p> |
---|
41 | åšäžé¢çäŸåäžïŒ |
---|
42 | /user/hadoop/dir1 å /user/hadoop/dir2 äŒè¢«åœæ¡£å°è¿äžªæ件系ç»ç®åœäž |
---|
43 | -- /user/zoo/foo.harãåœå建archiveæ¶ïŒæºæ件äžäŒè¢«æŽæ¹æå é€ã |
---|
44 | </p> |
---|
45 | </section> |
---|
46 | <section> |
---|
47 | <title> åŠäœæ¥çarchivesäžçæ件? </title> |
---|
48 | <p> |
---|
49 | archiveäœäžºæ件系ç»å±æŽé²ç»å€çãæ以ææçfs shellåœä»€éœèœåšarchiveäžè¿è¡ïŒäœæ¯èŠäœ¿çšäžåçURIã |
---|
50 | åŠå€ïŒarchiveæ¯äžå¯æ¹åçãæ以éåœåïŒå é€åå建éœäŒè¿åé误ãHadoop Archives çURIæ¯ |
---|
51 | </p><p><code>har://scheme-hostname:port/archivepath/fileinarchive</code></p><p> |
---|
52 | åŠæ没æäŸscheme-hostnameïŒå®äŒäœ¿çšé»è®€çæ件系ç»ãè¿ç§æ
åµäžURIæ¯è¿ç§åœ¢åŒ |
---|
53 | </p><p><code> |
---|
54 | har:///archivepath/fileinarchive</code></p> |
---|
55 | <p> |
---|
56 | è¿æ¯äžäžªarchiveçäŸåãarchiveçèŸå
¥æ¯/dirãè¿äžªdirç®åœå
å«æ件fileaïŒfilebã |
---|
57 | æ/diråœæ¡£å°/user/hadoop/foo.barçåœä»€æ¯ |
---|
58 | </p> |
---|
59 | <p><code>hadoop archive -archiveName foo.har /dir /user/hadoop</code> |
---|
60 | </p><p> |
---|
61 | è·åŸå建çarchiveäžçæ件åè¡šïŒäœ¿çšåœä»€ |
---|
62 | </p> |
---|
63 | <p><code>hadoop dfs -lsr har:///user/hadoop/foo.har</code></p> |
---|
64 | <p>æ¥çarchiveäžçfileaæ件çåœä»€- |
---|
65 | </p><p><code>hadoop dfs -cat har:///user/hadoop/foo.har/dir/filea</code></p> |
---|
66 | </section> |
---|
67 | </body> |
---|
68 | </document> |
---|