Multipath issues with iSCSI storage
Setup:
- 1 RHEV-M (i7, 8 GB RAM, 2 x GBit NIC)
- 2 Hypervisors (Conroe, 8 core, 24 GB RAM, 6 x Gbit NIC)
- Infortrend iSCSI storage 4 TB (4 disks), 4 x Gbit iSCSI channel
- Cisco Gbit switch (does no trunking according 802.3ad (at least our network guys did not succeed in configuring it....))
- Since switch doesn't support trunking we drive four subnets, one for each iSCSI channel.
- Each channel has a 1 TB LUN configured.
Problem:
When we configure a storage domain and add all 4 LUNS to the same (master) storage domain, the second hypervisor (not SPM) looses connection to storage domain after 5 minutes. (Could not connect to storage domain attched to it)
I see strange multipath errors on the hypervisor console.
This behavior is reproducible (did a fresh setup for reproduction).
It is not related to network config issues, afaik, we defined 4 logical storage networks in the cluster.
I can manually connect to the LUNs from each hypervisor. (iscsiadm)
- Solution:
Configuring the 4 LUNs in 4 seperate domains works fine and the second hypervisor doesn't loose connection.
- Question:
It would be nice to see our 4 TB storage in one domain instead of 4 domains holding 1 TB each.
Did I miss something with configuring storage domains properly or is this issue known ?
Responses
Sounds like you didn't configure the SAN properly. What you need to do is create a set of LUNs (one or more - up to you how many) and assign them to be accessible through all four iSCSI portals (the ethernet ports the storage box has).
Once you have that, simply discover and login to every IP address, when creating the storage domain, this should amount up to 4 paths to the same LUN or LUNs.
Hope it makes sense, if not I can try to elaborate
If I got U right all 4 LUNs have to be accessible by all four channels.
This would mean, logging to the first portal shows the 4 LUNs and logging to the second, (third fourth) portal would show the same LUNs again, right ?
If so, when I select the LUNs for master storage domain, I see the 4 LUNs four times, which checkboxes do I check ? all 16 or just each LUN once ?
it can be problematic, we are working to make it better. If they don't contain any important data, I would suggest using a free image hosting service and providing a link here.
If not, we can try and think of another solution
Interesting! I have asked a colleague who has a large setup handy to try and reproduce this issue, give me a few days to see what is going on
this really sounds like there is a SAN issue there, but that would require some extra investigation. Are you able to open a support case for this? I'm asking because such issues would usually require lots of logs to be sent, which is beyond the scope of the UG issues here, and you probably wouldn't want to expose private information in a public forum.
My colleague is a bit busy, but he promised to try and reproduce your issue this week, if he has the time. If he fails to do so, the support guys will definitely be able to do the same in their own labs
I have ran into a similar problem at a customer install of RHEVM with an IBM DS3512 iSCSI/SAS Subsystem. I could never figure out how to configure both target portal addresses in the storage domain. The way the DS3500 works is there are two controllers (A/B) and they are active/active with 4 1Gb iSCSI Host ports per controller. Each port/pair on each controller are on different subnets (A1/B1, A2/B2, A3/B3, A4/B4). Because the customer only have 2 GigE ports available to use, I was using A1/B1 pair.
When I created the storage domain, I had the LUNs all owned by controller A and discovered that address. RHEVM never let me put in the second address. I guess I know now from above I should have been able to go in and edit the Storage Domain to discover the second address. But I swear I tried that and it didn't work.
I will have to get back down to that customer and try that and see if I can get it to work.
James
When you create the SD, the flow is
1. enter target IP
2. discover
3. login
4. pick LUN
If at this point, instead of clicking "ok" you repeat these steps with the next IP, you will add a path. Wash, rince repeat...
Dan, thanks for the comment. Am I able to go into the storage domain and edit and add a new Target Portal address? I think I tried that and it would not let me select or login to any other LUNs.
I'll try it again when I get a chance and take some screen shots.
Regards,
James
Welcome! Check out the Getting Started with Red Hat page for quick tours and guides for common tasks.



