1 5
0 1
0 1
Try captioning on below examples
Input Picture Number of Beams Min-P Top-P