Troubleshooting AFS problems
If you have problems accessing your files in AFS, you can call the SCS Help Desk and ask them to take a look at the problem. If you wish to troubleshoot the problem yourself, there are a few things you can do.
Most AFS problems fall into one of the following categories:
- Permission/authentication problems
- Purely network problems
- Caching problems on your local machine
- AFS fileserver issues
Permission & authentication problems
The most common reason for "permission denied" errors is that one's AFS tokens have expired. You can use the tokens command to see the AFS tokens that you hold. If you do not have valid tokens, you can run kinit to authenticate and get new tokens. If you do have valid tokens, you should double-check that you are in the correct ACLs for the directories that you are trying to access. Note that if you are added to an AFS group, you will need to re-authenticate before you can use the permissions for that group.
If attempts to access a directory in AFS result in hangs or the directory not being found, you should first check to see if your machine is seeing the network. You can use the ping command to check network connectivity to other hosts (choose a host in the SCS machine room, like cs.cmu.edu). If you know which fileserver the directory is located on, you can ping that fileserver.
Local caching problems
A common cause of AFS problems is either a lack of disk space on the partition where a host's AFS cache is located or corruption of the local AFS cache. If you have trouble accessing files in AFS from a particular host, but can access these files from other hosts, then local AFS cache corruption or disk space issues may be the problem. See the section AFS issues and problems in the local Unix/Linux admin's guide for information on how to fix some of these local cache problems. Note that you will need root access on your machine to do this, and you should contact the Help Desk if you do not have root and/or are not comfortable fixing this type of problem.
AFS fileserver issues
The following commands can be useful in determining whether something is an AFS fileserver issue:
- /usr/local/bin/fs checkservers
- checks on the status of AFS fileservers that your local machine's cache manager has recently contacted.
- /usr/local/etc/vos examine <volume>
- reports on location and status of the given volume. Note that you will have to give the volume name (not the path to the volume). For example:
/usr/local/etc/vos examine user.bovikYou can get a volume name by using the fs lv <path> command. For example:
fs lv /afs/cs/user/bovikThe volume name for a user will always be of the form user.username.
- /usr/local/bin/fs checkvolumes
- will check the status of alternate locations of volumes for replicated collections. Doing so may fix some types of AFS access problems.
If you do notice a problem with an AFS server, please contact the SCS Help Desk to report it.