'read excel by column name java POI
Good afternoon experts have a problem and I need to read the entire excel file by column name instead of its index,example:
Column1 | Column2 | Column3
data1 data 2 data 3
POI allows me to read the column index by the method getColumnIndex()
returning for the Column1 = 0 , Column2= 1 etc,
but I need to read it by column name Column1
, Column2
etc ,
there any way to do this??
I need to read all fields of rows and columns by column name. attach the code which I read my file:
updated code:
import org.apache.poi.ss.usermodel.Cell;
import org.apache.poi.ss.usermodel.DataFormatter;
import org.apache.poi.ss.usermodel.Row;
import org.apache.poi.xssf.usermodel.XSSFSheet;
import org.apache.poi.xssf.usermodel.XSSFWorkbook;
import java.io.File;
import java.io.FileInputStream;
import java.io.FileNotFoundException;
import java.io.IOException;
import java.util.Iterator;
import java.util.regex.Matcher;
import java.util.regex.Pattern;
public class example {
DataFormatter fmt = new DataFormatter();
/**
* @param args the command line arguments
*/
public static void main(String[] args) throws FileNotFoundException, IOException {
example softMarti = new example();
FileInputStream file = new FileInputStream(new File("C:archive.xlsx"));
XSSFWorkbook workbook = new XSSFWorkbook(file);
XSSFSheet sheet = workbook.getSheetAt(0);
Iterator<Row> rowIterator = sheet.iterator();
while (rowIterator.hasNext()) {
Row row = rowIterator.next();
int rowIndex = row.getRowNum();
if (rowIndex < 1) {
continue;
}
Iterator<Cell> cellIterator = row.cellIterator();
while (cellIterator.hasNext()) {
Cell cell = cellIterator.next();
int columnIndex = cell.getColumnIndex();
if (columnIndex != 0 && columnIndex != 1 && columnIndex != 4) {
continue;
}
String columnName = "";
switch (columnIndex) {
case 0:
columnName = "column1";
break;
case 1:
columnName = "column2";
break;
case 4:
columnName = "column 4";
break;
}
String value = example.getValue(cell);
boolean valid = example.isValid(columnIndex, value);
if (valid) {
continue;
}
System.out.print(columnName + rowIndex);
System.out.println(" -> " + value);
}
}
// TODO code application logic here
}
private String getValue(Cell cell) {
switch (cell.getCellType()) {
case Cell.CELL_TYPE_BLANK:
return null;
case Cell.CELL_TYPE_BOOLEAN:
return "CELL_TYPE_BOOLEAN";
case Cell.CELL_TYPE_ERROR:
return "CELL_TYPE_ERROR";
case Cell.CELL_TYPE_FORMULA:
return "CELL_TYPE_FORMULA";
case Cell.CELL_TYPE_NUMERIC:
return fmt.formatCellValue(cell);
case Cell.CELL_TYPE_STRING:
return cell.getStringCellValue();
default:
return "none";
}
}
boolean isValid(int column, String value) {
if (value == null) {
return false;
}
String pattern = "";
switch (column) {
case 0:
pattern = "[A-Za-z0-9_\\- ]{1,20}";
break;
case 1:
pattern = "[A-Za-z0-9_\\- ]{1,80}";
break;
case 4:
pattern = "[0-9]{1,8}";
break;
}
Pattern pat = Pattern.compile(pattern);
Matcher mat = pat.matcher(value);
return mat.matches();
}
}
This code works but I need to validate column name because for my project columns may change position, that's my goal
Solution 1:[1]
Why don't you read the first row(0) cell values (0-n) (aka column names) and put(columnName,columnIndex) into a map of String/int. Then you can reference the column index by name.
Here's an example:
Map<String, Integer> map = new HashMap<String,Integer>(); //Create map
HSSFRow row = sheet.getRow(0); //Get first row
//following is boilerplate from the java doc
short minColIx = row.getFirstCellNum(); //get the first column index for a row
short maxColIx = row.getLastCellNum(); //get the last column index for a row
for(short colIx=minColIx; colIx<maxColIx; colIx++) { //loop from first to last index
HSSFCell cell = row.getCell(colIx); //get the cell
map.put(cell.getStringCellValue(),cell.getColumnIndex()) //add the cell contents (name of column) and cell index to the map
}
After this you'll have the map from columnName ---> index. Then you can do:
int idx = map.get("ColumnName");
....and you can use this in row.getCell(idx) to get the cells in all the other rows.
Read the comments below in the code. I can't help you other than this. You need to read the documentation and figure out how to do it.
Workbook workbook = WorkbookFactory.create(new FileInputStream("C:\\file.xlsx"));
Sheet sheet = workbook.getSheetAt(0);
totalRows = sheet.getPhysicalNumberOfRows();
Map<String, Integer> map = new HashMap<String,Integer>(); //Create map
HSSFRow row = sheet.getRow(0); //Get first row
//following is boilerplate from the java doc
short minColIx = row.getFirstCellNum(); //get the first column index for a row
short maxColIx = row.getLastCellNum(); //get the last column index for a row
for(short colIx=minColIx; colIx<maxColIx; colIx++) { //loop from first to last index
HSSFCell cell = row.getCell(colIx); //get the cell
map.put(cell.getStringCellValue(),cell.getColumnIndex()) //add the cell contents (name of column) and cell index to the map
}
List<ReportRow> listOfDataFromReport = new ArrayList<ReportRow>();
for(int x = 1; x<=totalRows; x++){
ReportRow rr = new ReportRow(); //Data structure to hold the data from the xls file.
HSSFRow dataRow = sheet.getRow(x); //get row 1 to row n (rows containing data)
int idxForColumn1 = map.get("Column1"); //get the column index for the column with header name = "Column1"
int idxForColumn2 = map.get("Column2"); //get the column index for the column with header name = "Column2"
int idxForColumn3 = map.get("Column3"); //get the column index for the column with header name = "Column3"
HSSFCell cell1 = dataRow.getCell(idxForColumn1) //Get the cells for each of the indexes
HSSFCell cell2 = dataRow.getCell(idxForColumn2)
HSSFCell cell3 = dataRow.getCell(idxForColumn3)
//NOTE THAT YOU HAVE TO KNOW THE DATA TYPES OF THE DATA YOU'RE EXTRACTING.
//FOR EXAMPLE I DON'T THINK YOU CAN USE cell.getStringCellValue IF YOU'RE TRYING TO GET A NUMBER
rr.setColumn1(cell1.getStringCellValue()); //Get the values out of those cells and put them into the report row object
rr.setColumn2(cell2.getStringCellValue());
rr.setColumn3(cell3.getStringCellValue());
listOfDataFromReport.add(rr);
}
//Now you have a list of report rows
for(int j = 0; j< listOfDataFromReport.size();j++){
System.out.println("Column 1 Value: " + listOfDataFromReport.get(j).getColumn1())
//etc...
}
//This class holds the values from the xls file. You may not need it
// I have no idea what you're doing with the data. If you simply wanted to
//print the data to console you wouldn't need it.
public static class ReportRow{
private String column1;
private String column2;
private String column3;
public String getColumn1(){
return this.column1;
}
public void setColumn1(String column1){
this.column1 = column1;
}
public String getColumn2(){
return this.column2;
}
public void setColumn2(String column2){
this.column2 = column2;
}
public String getColumn3(){
return this.column3;
}
public void setColumn3(String column3){
this.column3 = column3;
}
}
Solution 2:[2]
I wrote a method
public static int columnName(String a) throws EncryptedDocumentException, InvalidFormatException, IOException {
int coefficient = 0;
String excelFilePath = ConfigurationReader.getProperty("pathToYourFile"); // or specify the path directly
FileInputStream inputStream = new FileInputStream(new File(excelFilePath));
Workbook wb = WorkbookFactory.create(inputStream);
Sheet sh = wb.getSheet("Sheet1");
Row row = sh.getRow(0);
int cellNum = row.getPhysicalNumberOfCells();
for (int i = 0; i < cellNum; i++) {
if ((row.getCell(i).toString()).equals(a)) {
coefficient = i;
}
}
return coefficient;
}
and then just call it in my code:
Cell anyCellName = row.getCell(columnName("NameOfColumnInMyExcell"));
And like this with any column names. Now I'm able to move my columns in any order and code works.
Solution 3:[3]
Here is my way, I hope it can help
First of all let us get columns names and put it into a map
Map<String, Integer> requiredHeaders = new HashMap<>();
FileInputStream file = new FileInputStream(new File("filename.xlsx"));
Workbook workbook = new XSSFWorkbook(file);
DataFormatter formatter = new DataFormatter();
Sheet sheet = workbook.getSheetAt(0);
for (Cell cell : sheet.getRow(0)) {
requiredHeaders.put(cell.getStringCellValue(), cell.getColumnIndex());
}
Then We Can Loop for rows to get the required row using the column index
for (int i = 1; i <= sheet.getLastRowNum(); i++) {
Row row = sheet.getRow(i);
System.out.println("serial = " + formatter.formatCellValue(row.getCell(requiredHeaders.get("serial"))));
System.out.println("pin = " + formatter.formatCellValue(row.getCell(requiredHeaders.get("pin"))));
}
Now Full Code Will be Like :
private void readFile() throws FileNotFoundException, IOException {
Map<String, Integer> requiredHeaders = new HashMap<>();
FileInputStream file = new FileInputStream(new File("filename.xlsx"));
Workbook workbook = new XSSFWorkbook(file);
DataFormatter formatter = new DataFormatter();
Sheet sheet = workbook.getSheetAt(0);
for (Cell cell : sheet.getRow(0)) {
requiredHeaders.put(cell.getStringCellValue(), cell.getColumnIndex());
}
for (int i = 1; i <= sheet.getLastRowNum(); i++) {
Row row = sheet.getRow(i);
System.out.println("serial = " + formatter.formatCellValue(row.getCell(requiredHeaders.get("serial"))));
System.out.println("pin = " + formatter.formatCellValue(row.getCell(requiredHeaders.get("pin"))));
}
workbook.close();
}
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
Solution | Source |
---|---|
Solution 1 | |
Solution 2 | Gaspar |
Solution 3 | Abanoub Hany |