• java获取文件编码方式


    1. 引入依赖

      
            <dependency>
                <groupId>com.github.jiangxincodegroupId>
                <artifactId>cpdetectorartifactId>
                <version>1.0.10version>
            dependency>
    
    • 1
    • 2
    • 3
    • 4
    • 5
    • 6

    2. 工具类

    import info.monitorenter.cpdetector.io.*;
    
    import java.io.*;
    import java.nio.charset.Charset;
    import java.util.ArrayList;
    import java.util.List;
    
    public class ReadUtil {
        public static List<String> readFile(String filePath){//filePath为文件路径,charset为字符编码。通常使用UTF-8
            List<String> res = new ArrayList<>();
            File file = new File(filePath);
            String charset = getCharsetName(file)
            BufferedReader reader = null;
            String tempString = null;
            StringBuffer str = new StringBuffer();
            try {
                InputStreamReader isr = new InputStreamReader(new FileInputStream(file), charset);
                reader = new BufferedReader(isr);
                while ((tempString = reader.readLine()) != null) {
                    res.add(tempString);
                }
                reader.close();
            } catch (FileNotFoundException e) {
                System.out.println("文件不存在");
                return null;
            } catch (IOException e) {
                System.out.println("文件读取异常");
                return null;
            }
            return res;
        }
        /**
         * 获取文件编码方式
         */
        public static String getCharsetName(File file) throws IOException {
            String charsetName = "UTF-8";
            CodepageDetectorProxy detector = CodepageDetectorProxy.getInstance();
            detector.add(new ParsingDetector(false));
            detector.add(JChardetFacade.getInstance());
            detector.add(ASCIIDetector.getInstance());
            detector.add(UnicodeDetector.getInstance());
            Charset charset = detector.detectCodepage(file.toURI().toURL());
            if (charset != null) charsetName = charset.name();
            return charsetName;
        }
    
    
        public static void main(String[] args) throws IOException {
            File file = new File("D:\\testDirs\\data\\test.dat");
            System.out.println(getCharsetName(file));
    
        }
    }
    
    
    • 1
    • 2
    • 3
    • 4
    • 5
    • 6
    • 7
    • 8
    • 9
    • 10
    • 11
    • 12
    • 13
    • 14
    • 15
    • 16
    • 17
    • 18
    • 19
    • 20
    • 21
    • 22
    • 23
    • 24
    • 25
    • 26
    • 27
    • 28
    • 29
    • 30
    • 31
    • 32
    • 33
    • 34
    • 35
    • 36
    • 37
    • 38
    • 39
    • 40
    • 41
    • 42
    • 43
    • 44
    • 45
    • 46
    • 47
    • 48
    • 49
    • 50
    • 51
    • 52
    • 53
    • 54
  • 相关阅读:
    Attention is all you need 论文笔记
    [sd-tagging-helper] How to start and how to add tags for dummies
    UE4通过蓝图创建简单场景:一颗旋转的石头
    MySQL日志管理和完全备份增量备份与恢复
    微调Qwen2大语言模型加入领域知识
    蓝桥杯2023年-阶乘的和(数学推理,C++)
    盲目跟风考PMP认证?PMP还剩多少含金量?
    【Leetcode】189. 轮转数组
    APS排程软件与ERP、MES的集成方式
    【Java】抽象类和接口
  • 原文地址:https://blog.csdn.net/lz970704/article/details/125895993